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NOVEL NUCLEIC ACIDS AND POLYPEFTTOES 

1. TECHNICAL FIELD 

The present invention provides novel polynucleotides and proteins encx)ded by such 
5 polynucleotides, along with' uses for these polynucleotides and proteins, for example in 
therapeutic, diagnostic and research methods. 

2. BACKGROUND 

Technology aimed at the discovery of protein factors (including e.g., cytokines, such as 

1 0 lympholdnes, interferons, CSFs, chemokines, and interleukins) has matured rapidly over the past 
decade. The now routine hybridization cloning and expression cloning techniques clone novel 
polynucleotides "directly" in the sense that they rely on information directiy related to the 
discovered protein (i.e., partial DNA/amino acid sequence of the protein in the case of 
hybridization cloning; activity of the protein in the case of expression cloning). More recent 

IS "indirect" cloning techniques such as signal sequence cloning, -^^ch isolates DNA sequences 
based on the presence of a now well-recognized secretory leader sequence moti^ as well as 
various PCR-based or low stringency hybridization-based cloning techniques, have advanced the 
state of the art by making available large numbers of DNA/amino acid sequences for proteins 
that are known to have biological activity, for example, by virtue of their secreted nature in the 

20 case of leader sequence cloning, by virtue of their cell or tissue source ia the case of PCR-based 
techniq\ies, or by virtue of structural sii^ularily to other genes of known biological activity. 

Identified polynucleotide and polypeptide sequences have numerous applications in, for 
example, diagnostics, forensics, gene mapping; identification of mutations responsible for 
genetic disorders or other traits, to assess biodiversity, and to produce many other types of data 

25 and products dependent on DNA and amino acid sequences. 

3. SUMMARY OF TBE INVENTION 

The compositions of the present invention incliide novel isolated polypeptides, novel 
isolated polynucleotides encoding such polypeptides, including recombinant DNA molecules, 
3 0 cloned genes or degenerate variants thereoi^ especially naturally occurring variants such as allelic 
variants, antisense polynucleotide molecules, and antibodies that specifically recognize one or more 
epitqpes present on such polypeptides, as well as hybridomas producing such antibodies. 

The compositions of the present invention additionally include vectors, including ejqpression 
vectors, containing tiie polynucleotides of tiie invention, cells genetically engineered to contain such 
35 polynucleotides and cells genetically engineered to e^qnress such polynucleotides. 
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The present invention relates to a collection or library of at least one novel nucleic acid 
sequeace assembled from expressed sequence tags (ESTs) isolated mainly by sequencing by 
hybridization (SBH), and in some cases, sequences obtained from one or more public databases. 
The invention relates also to the proteins encoded by such polynucleotides, along -wilfa therqieutic, 
S diagnostic and research utilities for these polynucleotides and proteins. These nucleic add 

sequencesare designated as SEQ ID NO: 1-1350. The polypeptides sequences are designated SEQ 
ID NO: 1351-2700. The nucleic acids and polypeptides are provided in the Sequence Listing. In 
the nucleic acids provided in the Sequence Listing, A is adenosine; C is cytosine; G is guanine; T is 
thymine; and N is any of the four bases. In the amino acids provided in tiie Sequence Listing, * 

1 0 corresponds to the stop codon. 

The nucleic acid sequences of the present invention also include, nucleic add sequences that 
hybridize to the complement of SEQ ID NO:1-1350 under stringent hybridization conditions; 
nucleic acid sequences wfaidi are allelic variants or spedes homologues of any of the nucleic add 
sequences recited above, or nucldc acid seqiiences that encode a peptide comprising a specific 

15 domaia or truncationof the peptides encoded by SEQ ID NO:1-1350 . A polynucleotide 

comprising a nucleotide sequence having at least 90% identity to an identifying sequence of SEQ 
ID NO: 1-1350 or a degenerate variant or &agmentthereo£ The identifying sequence can be 1 00 
base pairs in length. 

The nucldc add sequences of the present invention also include the sequence information 
20 fromthenucldcaddsequencesof SEQIDNO:1-1350. The sequence information can be a 
segment of any one of SEQ ID NO: 1-1 350 that uniquely identifies or represents tiie sequence 
mfQrmationofSEQIDNO:l-1350. 

A collection as used in this application can be a collection of only one polynucleotide. The 
collection of sequence information or identifying information of each sequence can be provided on 
25 a nucldc add array. In one embodiment, segments of sequence information is provided on a 

nucldc add array to detect the polynucleotide that contains the segment. The array can be designed 
to detect full-match or Tni>CTnartr:>i to tiie polynucleotide tiiat contains the segment The collection 
can also be provided in a compxiter-readable£3nnaL 

This invraition also includes the reverse or direct conq)lement of any of the nucldc add 
30 sequences recited above; doningor expression vectors containing the nucleic acid sequences; and 
host ceUs or organisms transformed with these expression vectors. Nucldc acid sequences (or their 
reverse or direct complements) according to the invention have numerous applications in a variefy 
of techniques known to those skilled in the art of molecular biology, such as use as hybridization 
probes, use as primers for PCR, use in an array, use in computer-readablemedia, use in sequencing 
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iiill-length genes, me for chromosome and gene mapping, use in the recombinant production of 
protein, and use in the generation of anti-sense DNA or RNA, their chemical analogs and the like. 

In a preferred embodiment, ihe nucleic add sequences of SEQ ID NO: 1 -1 350 or novel 
segments or parts of the nucleic acids of the invention are used as primers ra expression assays that 
5 are well known in the art In a particularly preferred embodiment, the nucleic acid sequences of 
SEQ ID NO: 1 -1 350 or novel segments or parts of the nucleic acids provided herein are used in 
diagnostics for identifying expressed genes or, as weU known in the art and exemplified by Vollrath 
et al.. Science 258:52-59 (1992), as expressed sequence tags for physical moping of the human 
goaome. 

10 The isolated polynucleotides of the invention include, but are not limited to, a 

polynucleotide comprising any one of the nucleotide sequences set forth in SEQ ID NO: 1-1 350; a 
polynucleotidecomprisingany of the fiill length protein coding sequences of SEQ ID NO: 1 - 1350; 
and a polynucleotide comprising any of the nucleotide sequences of the mature protein coding 
sequences of SEQ ID NO: 1- 1350. The polynucleotidesofthe present inventionalso include, but 

15 are not limited to, a polynucleotidethathybridiTBSimder stringent hybridization conditions to (a) 
the complement of any one of the nucleotide sequences set forth in SEQ ED NO: 1-1350; (b) a 
nucleotide sequence encoding any one of the amino acid sequences set forth in the Sequence Listing 
(e.g., SEQ ID NO: 1351-2700); (c) a polynucleotide which is an allelic variantof any 
polynucleotidesrecited above; (d) a polynucleotide whidi encodes a species homolog (e.g. 

20 orthologs) of any of the proteins recited above; or (e) a polynucleotide that encodes a polypeptide 
comprising a specific domain or truncation of any of the polypeptides comprising an amino add 
sequence set forth in the Sequence Listing. 

The isolated polypeptides of tiie invention include, but are not limited to, a polypeptide 
comprising any of the amino acid sequences set forth in the Sequence Listing; or the corre^ndijag 

25 full length or mature protein. Polypeptides of &e invention also include polypeptides with 

biological activity that are encoded by (a) any of the polynudeotides having a nucleotide sequeixx 
set forth in SEQ ID NO:1-1350; or (b) polynucleotidesthat hybridize to tiie complement of the 
polynucleotides of (a) under stringent hybridization conditions. Biologically or immunologically 
active variants of any of the polypeptide sequences in the Sequence Listing, and "substantial 

30 equivalents^thereof (e.g., with at leastabout 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or 99% 
amino add sequence identity) that preferably retain biological activity are also contemplated. The 
polypeptides of the invention may be wholly or partially chemically synthesized but are preferably 
produced by recombinant means using tiie genetically engineered cells (e.g. host cells) of the 
inventiorL 
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The invention also provides compositions comprising a polypeptide of die invention. 
Polypq>tide compositions of the invention may fiirther comprise an acceptable carrier, such as a 
hydrophilic, e.g., phaimaceutically acceptable, carrier. 

The invention also provides host cells transfotmed or transfected with a pol3mucleotide of 
5 the invention. 

The invention also relates to methods for producing a polypeptide of the invention 
comprising growing a culture of the host cells of the invention in a suitable culture medium 
under conditions permitting expression of the desired polypeptide, and purifying the polypeptide 
fix>m the culture or &om the host ceUs. Preferred anbodiments include those in which the 

10 protein produced by stich process is a mature form of the proteLo. 

Polynucleotides according to the invention have numerous applications in a variety of 
techniques known to those skilled in the art of molecular biology. These techniques include use 
as hybridization probes, use as oligomers, or primers, for PGR, use for chromosome and gene 
mapping, use in the recombinant production of protein, and use in generation of anti-sense DNA 

15 or KNA, their chemical analogs and the like. For example, when tiie expression of an mRNA is 
largely restricted to a particular cell or tissue type, polynucleotides of the invention can be used 
as hybridization probes to detect the presence of the particular cell or tissue mRNA in a sample 
using, e.g., in situ hybridization. 

In other exemplary embodiments, the polynucleotides are used in diagnostics as 

20 ejqjressed sequence tags for identifying expressed genes or, as well known in the art and 
exenq)lified by VoUrath et al.. Science 258:52-59 (1992), as expressed sequence tags for 
physical mapping of the human genome. 

The polypeptides according to the invention can be used in a variety of conventional 
procedures and methods tihat are currently applied to other proteins. For example, a polypeptide 

25 of the invention can be used to generate an antibody that specifically binds the polypeptide. Such 
antibodies, particularly monoclonal antibodies, are useful for detecting or quantitating the 
polypeptide in tissue. The polypeptides of the invention can also be iised as molecular weight 
markers, and as a food siqiplement 

Methods are also provided for preventing, treatiog, or ameUorating a medical condition 

30 which comprises the step of administering to a mammalian subject a tiierapeutically effective 
amount of a composition comprising a polypeptide of the present invention and a 
pharmacexitically acceptable carrier. 

In particular, the polypeptides and polynucleotides of the invention can be utilized, for 
example, in methods for the prevention and/or treatment of disorders involving aberrant protein 

35 expression or biological activity. 
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The present invention further relates to methods for detecting the presence of the 
polynucleotides or polypeptides of the invention in a sample. Sudi methods can, for example, h 
utilized as part of prognostic and diagnostic evaluation of disorders as recited herein and for the 
identification of subjects exhibiting a predisposition to such conditions. The invention provides 
5 a method for detecting the polynucleotides of the invention in a san^le, comprising contacting 
the sample with a compound that binds to and fomis a complex with the polynucleotide of 
interest for a period sufficient to form the complex and under conditions sufGcient to form a 
complex and detecting the complex such that if a complex is detected, the polynucleotide of 
interest is detected. The invmtion also provides a method for detecting the polypeptides of the 
1 0 invention in a sample comprising contacting the sample with a compound ihat binds to and fom 
a complex with the polypeptide under conditions and for a period sufficient to form the comple}i 
and detecting the formation of the complex such that if a complex is formed, the polypeptide is 
detected 

The invention also provides kits comprising polynucleotide probes and/or monoclonal 
1 S antibodies, and optionally quantitative standards, for carrying out metiiods of the invention. 
Furtiiemiore, the invention provides methods for evaluating the efficacy of drugs, and 
monitoring the progress of patients, involved in clinical trials for the treatment of disord^ as 
recited above. 

The invention also provides methods for the identification of compoimds that modulate 

20 (i. e., increase or decrease) the expression or activity of the polynucleotides and/or polypeptides 
of the invention. Such me&ods can be utilized, for exanqjle, for the identification of compound 
that can ameliorate symptoms of disorders as recited herein. Such methods can include, but are 
not limited to, assays for identifying compoimds and other substances that interact with ie.g., 
bind to) the polypeptides of the invention. The invention provides a method for identifying a 

25 compound tfiat binds to the polypeptides of the invention comprising contacting the compound 
with a polypeptide of the invention in a cell for a time sufficient to form a 
polypeptide/compound complex, wherein the complex drives e^qjression of a reporter gene 
sequence in the cell; and detecting the complex by detecting the reporter gene sequence 
expression such that if expression of the reporter gene is detected the compound the binds to a 

30 polypeptide of the invention is identified. 

The methods of the invention also provides methods for treatment v/Mda involve the 
administration of the polynucleotides or polypeptides of the invention to individuals exhibiting 
symptoms or tendencies. In addition, the invention encompasses methods for treating diseases o 
disorders as recited herein comprising administerii^ cono^ounds and otiier substances that 

35 modxilate the overall activity of the target gene products. Compounds and other substances can 
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effect such modulation either on the level of target gene/protein e}q)ression or target protein 
activity. 

The polypqptides of the present invention and the polynucleotides encoding them are also 
useful for the same functions known to one of skill in the art as tiie polypeptides and 
5 polynucleotides to v^ch tiiey have homology (set forth in Table 2). If no homology is set forth 
for a sequence, then the polypeptides and polynucleotides of the present invention are usefiil for 
a variety of applications, as described herein, including use in arrays for detection. 

1 0 4. DETAILED DESCRIPTION OF THE INVENTION 

4.1 DEPINrnONS 

It must be noted that as used herein and in the appended clainos, the singular forms "a", 
"an" and "the" include plural references unless the context clearly dictates otherwise. 

1 5 The term "active" refers to tiiose forms of the polypeptide which retain the biologic 

and/or immunologic activities of any naturally occurring polypeptide. According to the 
invention, the terms "biologically active" or "biological activity^' refer to a protein or peptide 
having structural, regulatory or biochemical functions of a naturally occurring molecule. 
Likewise "immunologically active" or "immunological activitjr" refers to the capability of the 

20 natural, recombinant or synthetic polypeptide to induce a specific immime response in 
appropriate animals or cells and to bind with specific antibodies. 

The term "activated cells" as used in this application are those cells which are engaged in 
extracellular or intracellular membrane trafiGcking, including the export of secretory or 
enzymatic molecules as part of a normal or disease process. 

25 The terms "complementaiy" or "complementarity" refer to the natural binding of 

polynucleotides by base pairing. For example, the sequence 5'-AGT-3' binds to the 
complementary sequence 3'-TCA-5'. Complementarity between two single-stranded molecules 
may be "partial" sudi that only some of tiie nucleic acids bind or it may be "complete" such tiiat 
total complemoitarity exists between the single stranded molecules. The degree of 

30 complementarity between the nucleic add strands has significant effects on the efGlciency and 
strength of the hybridization between the nucleic acid strands. 

The term "embryonic stem cells (ES)" refers to a cell that can give rise to many 
differentiated cell types in an embryo or an adult, including the germ cells. The term "germ line 
stem cells (GSCs)" refers to stem cells derived from primordial stem cells that provide a steady 

35 and continuous source of germ cells for the production of gametes. The term "primordial germ 
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cells (PGCs)" refers to a small population of cells set aside from other cell lineages particularly • 
from the yolk sac, mesenteries, or gonadal ridges during embryogenesis that have the potential to 
diffeientiate into germ cells and other cells. PGCs are the source from which GSCs and ES cells 
are derived The PGCs, the GSCs and the ES cells are capable of self-renewal. Thus these cells 
S not only populate the germ line and give rise to a plurality of terminally differentiated cells that 
comprise the adult specialized organs, but ate able to regenerate tibiemselvea. 

The tenn "expression modulating fragment," EMF, means a series of nucleotides which 
modulates the esqnression of an operably linked ORF or another EMF. 

As used herein, a sequence is said to "modulate the expression of an operably linked 

10 sequence" when the expression ofthe sequence is altered by the presence of tiie EMF. EMFs 
include, but are not limited to, promoters, and promoter modulating sequences (inducible 
elements). One class of EMFs are nucleic acid fragments which induce the esqpression of an 
operably linked ORF in response to a si>ecific regulatory &ctor or physiological event. 
The terms "nucleotide sequence" or "nucleic acid" or "polynucleotide" or 

1 5 "oligonculeotide" are used interchangeably and refer to a heteropolymer of nucleotides or the 
sequence of these nucleotides. These phrases also refer to DNA or RNA of genomic or synthetic 
origin which may be single-stranded or double-stranded and may rqxresent the sense or the 
antisense strand, to peptide nucleic acid (PNA) or to any DNA-like or RNA-like material. In the 
sequences herein A is adenine, C is cytosine, T is thymine, G is guanine and N is A, C, G or T 

20 (U). It is contemplated that where the polynucleotide is RNA, the T (thymine) in the sequences 
provided herein is substituted with U (uracil). Generally, nucleic acid segments provided by this 
invention may be assembled from fragments of the g^ome and short oligonucleotide linkers, or 
from a series of oligonucleotides, or from individual nucleotides, to provide a synthetic nucleic 
acid which is capable of being expressed in a recombinant transcriptional imit comprising 

25 regulatory elements derived fix)m a microbial or viral operon, or a eukaiyotic gene. 

The terms "oligonucleotide fiiagment" or a "polynucleotide fragment", "portion," or 
"segment" or '*probe" or "primei" are used interchangeably and refer to a sequence of nucleotide 
residues v^ch are at least about 5 nucleotides, more preferably at least about 7 nucleotides, 
more preferably at least about 9 nucleotides, more preferably at least about 11 nucleotides and 

30 most preferably at least about 17 nucleotides. The fragment is preferably less than about 500 
nucleotides, preferably less than about 200 nucleotides, more preferably less than about 100 
nucleotides, more preferably less than about 50 nucleotides and most preferably less than 30 
nucleotides. Preferably the probe is from about 6 nucleotides to about 200 nucleotides, 
preferably from about 1 5 to about 50 nucleotides, more preferably from about 17 to 30 

35 nucleotides and most preferably from about 20 to 25 nucleotides. Preferably the fiiagments can 
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be used in polymerase chain reaction (PGR), various hybridization procedures or microairay 
procedures to identify or amplify identical or related parts of mSNA or DNA molecules. A 
fragment or segment may uniquely identify each polynucleotide sequence of the present 
invention. Preferably the fragment comprises a sequence substantially similar to any one of SEQ 
5 IDNOs:l-1350. 

Probes may, for example, be used to determine whether specific mRNA molecules are 
present in a cell or tissue or to isolate similar nucleic acid sequences fiom chromosomal DNA as 
described by Walsh et al. (Walsh, P.S. et al., 1992, PGR Methods Appl 1:241-250), They may 
be labeled by nick translation, Klenow fiU-in reaction, PGR, or otiier methods well known in the 
10 art Probes ofihe present invention, their preparation and/or labeling are elaborated in 
Sambrook, J. et al., 1989, Molecxdar Gloiiing: A I^oratory Manual, Gold Sprioig Harbor 
Laboratory, NY; or Ausubel, F.M. et al., 1989, Current Protocols in Molecular Biology, John 
WUey & Sons, New York NY, both of which are incorporated herein by reference in their 
entirety. 

IS The nucleic acid sequences of tiie present invention also include the sequence 

information fit)m the nucleic acid sequences of SEQ ID NO:1-1350. The sequence information 
can be a segment of any one of SEQ IDNO:1-13SO that uniquely identifies or represents the 
sequence information of that sequence of SEQ ID NO:1-1350. One such segment can be a 
twenty-mer nucleic acid sequence because the probability that a twenty-mer is fiilly matched in 

20 the human genome is 1 in 300. In the human genome, there are three billion base pairs in one set 
of chromosomes. Because 4^" possible twenty-mers ^st, there are 300 times more twenty-mers. 
than there are base pairs in a set of human chromosomes. Using the same analysis, the 
probability for a seventeen-mer to be fijlly matched in the human genome is ^)proximately 1 in 
S. When these segments are used in arrays for expression studies, fifteoi-mer segments can be 

25 used The probability that the fifleen-mer is fiilly matched in the expressed sequences is also 
approximately one in five because expressed sequences comprise less than approximately 5% of 
the entire genome sequence. 

Similarly, v/ben using sequence infotmationfor detecting a single mismatch, a segment can 
be a twenty-five mer. The probability that the twenty-fivemer would appear in a human genome 

30 with a single mismatdi is calculatedby multiplying the probability for a fijll match (1^^ times tiie 
increased probability for mismatch at each nucleotide position (3 x 25). The probability tiiat an 
eighteen mer with a single mismatch can be detected in an array for expression studies is 
approximately one in five. The probabilitythat a twenty-mer with a ^ngle mismatch can be 
detected in a human genome is approximately one in five. 
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The tenn "open reading frame," ORF, means a series of nucleotide triplets coding for 
amino acids without any termination codons and is a sequence translatable into protein. 

The temis "operably linked" or "operably associated" refer to fimctionally related nucleic 
acid sequences. For example, a promoter is operably associated or operably linked -with a coding 
S sequence ifthe promoter controls the transcription ofthe coding sequence. While operably 
linked nucleic acid sequences can be contiguous and in the same reading frame, certain genetic 
elements e.g. repressor genes are not contiguously linked to the coding sequence but still control 
transcription/translation of the coding sequence. 

The term "pluripotent" refers to the cqiability of a cell to differentiate into a number of 
10 differentiated cell types that are present in an adult organism. A pluripotent cell is restricted in its 
dififerentiation capability in comparison to a totipotent cell. 

The terms "polypeptide" or "peptide" or "amino acid sequence" refer to an oligopeptide, 
pqitide, polypeptide or protein sequence or fragment thereof and to naturally occurring or 
synthetic molecules. A polypeptide "fragment," "portion," or "segment" is a stretch of amino 
15 add residues of at least about 5 amino adds, preferably at least about 7 amino acids, more 
preferably at least about 9 amino acids and most preferably at least about 1 7 or more amino 
acids. The pq)tide preferably is not greater than about 200 amino acids, more preferably less 
than ISO amino adds aiKi most preferably less than 100 amino adds. Preferably the peptide is 
from about 5 to about 200 amino acids. To be active, any polypeptide must have sufficient 
20 length to display biological and/or immimological activity. 

The term "naturally occurring polypeptide" refers to polypq>tides produced by cells that, 
have not been genetically engineered and specifically contemplates various polypeptides arising 
from post-translational modifications of the polypeptide including, but not limited to, 
acetylation, carboxylation, glycosylation, phosphorylation, lipidation soA acylation. 
25 The term "translated protein coding portion" means a sequence -which encodes for the frill 

length protein vs^dbi may include any leader sequence or any processing sequence. 

The term "mature protein coding sequence" means a sequence which encodes a peptide 
or protein without a signal or leader sequence. The "mature protein portion" means that portion 
of the protein which does not include a signal or leader sequence. The peptide may have been 
30 produced by processiag in the cell which removes any leadei/signal sequence. The mature 
protein portion may or may not include the initial methionine residue. The methionine residue 
may be removed fix)m the protein diiring processing in the cell. The peptide may be produced 
synthetically or the protein may have been produced using a polynucleotide only encoding for 
the mature protein coding sequence. 
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The tenn "deriArative" refexs to polypeptides diemically modified by such techniques as 
ubiquitinatioD, labeling {e.g., with ladionuclides or various enzymes), covalent polymer 
attachment such as pegylation (derivatization witii polyethylene glycol) and insertion or 
substitution by chemical synthesis of amino acids such as omitbroe, ^^ch do not normally occur 
S ia human proteins. 

The term "variant"(or "analog") refers to any polypeptide differing from naturally 
occurring polypeptides by amino acid insertions, deletions., and substitutions, created using, e g., 
recombijoant DNA techniques. Guidance in determining which anaino acid residues may be 
replaced, added or deleted without abolishing activities of interest, may be found by comparing 

10 the sequence of the particular polypeptide with that of homologous peptides and minimizing the 
number of amino acid sequence changes made in regions of high homology (conserved regions) 
or by replacing amino acids with consensus sequence. 

Alternatively, recombinant variants encoding these same or similar polypeptides may be 
synthesized or selected by making use of the "redundancy" ia the genetic code. Various codon 

IS substitutions, such as the silent changes which produce various restriction sites, may be 
introduced to optimize cloning into a plasmid or viral vector or expression in a particular 
prokaiyotic or eukaryotic system. Mutations in the polynucleotide sequence may be reflected in 
the polypeptide or domains of other peptides added to the polypeptide to modify the properties of 
any part of the polypeptide, to change characteristics such as ligand-binding afOnities, interchain 

20 afBnities, or degradation/turnover rate. 

Preferably, amino acid "substitutions" are the result of replacing one amino acid with 
another amino acid having similar structural and/or chemical properties, Le., consravative amino 
acid replacements. "Conservative" amino acid substitutions may be noade on the basis of 
similarity in polarity, charge, solubility, hydrophobidty, hydrophilicity, and/or the amphipathic 

25 nature of the residues involved. For example, noi^lar (hydrophobic) amino acids include 
alanine, leucine, isoleucine, valine, proline, phenylalanine, tryptophan, and mefiiionine; polar 
neutral amino acids include glycine, serine, threonine, cysteine, tyrosine, aftparagjne, and 
glutamine; podtrvely charged (basic) amino acids include arginine, lysine, andhistidine; and 
negatively charged (acidic) amino acids include aspartic acid and glutamic acid, "hisertions" or 

30 "deletions" are preferably in the range of about 1 to 20 amino acids, more prefembly 1 to 10 

amino adds. The variation allowed may be experimentally determined by systematically making 
insertions, deletions, or substitutions of amino adds in a polypeptide molecule using 
recombinant DNA techniques and assaying the resulting recombinant variants for activity. 
Alternatively, where alteration of function is desired, insertions, deletions or 

35 non-conservative alterations can be engineered to produce altered polypeptides. Such alterations 
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can, for example, alter one or more of the biological fiinctions or biochemical characteristics of 
the polypeptides of ttte inventioiL For example, such alterations may change polypeptide 
characteristics such as ligand-binding afBnities, interchain afiSnities, or degradation/turnover 
rate. Further, such alterations can be selected so as to generate polypeptides that are better suited 
S for expression, scale up and the like in the host cells chosen for expression. For example, 
cysteine residues can be deleted or substituted with another amino acid residue in order to 
eliminate disulfide bridges. 

The terms "purified" or "substantially purified" as used herein denotes that the indicated 
nucleic acid or polypeptide is ptesast in the substantial absence of otiier biological 

10 macromolecuies,e.g., polynucleotides, proteins, and the Uke. In one embodiment, the 

polynucleotide or polypeptide is purified such that it constitutes at least 95% by weight, more 
preferably at least 99% by weight, of the indicated biological macromolecules present (but water, 
buffers, and other small molecules, especially molecules having a molecular weight of less than 
1 000 daltons, can be present). 

1 5 The term "isolated" as used herein refers to a nucleic acid or polypeptide separated fix)m 

at least one other component (e.g. , nucleic acid or polypeptide) present with the nucleic acid or 
polypeptide in its natural source. In one embodiment, the nucleic acid or polypeptide is found in 
the presence of Of anything) only a solvent, buffer, ion, or other camponent normally present in a 
solution of the same. The terms "isolated" and "purified" do not encompass nucleic acids or 

20 polypeptides present in their natural source. 

The term "recombinant," when used herein to refer to a polypeptide or protein, means 
that a polypeptide or protein is derived firom recombinant (e.g. , microbial, insect, or mammalian) 
egression systems. "Microbial" refers to recombinant polypeptides or proteins made in 
bacterial or fimgal (e.g., yeast) e}q>ression systems. As a product, "recombinant microbial" 

25 defines a polypeptide or protein essentially fi:ee of native endogenous substances and 

unaccompanied by associated native glycosylation. Polypeptides or proteins expressed in most 
bacterial cultures, e.g., E. coli, will be fiee of glycosylation modifications; polypeptides or 
proteins e^qnessed in yeast will have a glycosylation pattern in general different fivm those 
stressed in mammalian cells. 

30 The term "recombinant e3q>ression vehicle or vector" refers to a plasmid or phage or virus 

or vector, for expressing a polypeptide fiom a DNA (RNA) sequence. An e^qnession vehicle can 
comprise a transcriptional imit con^sing an assembly of (1) a genetic element or elements 
having a regulatory role in gene expression, for example, promoters or enhancers, (2) a structural 
or coding sequence which is transcribed into mRNA and translated into protein, and (3) 

35 appropriate transcription initiation and termination sequences. Structural units intended for use 
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in yeast or eukaryotic e>q)Tession systems preferably include a leader sequence enabling 
extracellular secretion of translated protein by a host cell. Ahematively, where recombinant 
protein is expressed without a leader or transport sequence, it may include an amino terminal 
methionine residue. This residue may or may not be subsequently cleaved finm the expressed 
5 recombinant protein to provide a final product 

The term "recombinant e;q>ression system" means host cells v/iuch have stably integrated 
a recombinant transcriptional unit into chromosomal DNA or cairy the recombinant 
traoscriptional unit extrachromosomally. Recombinant expression systems as defined herein wUl 
express heterologous polypeptides or proteins upon induction of the regulatory elements linked 

10 to the DNA segment or synthetic gene to be expressed. This term also means host cells which 
have stably integrated a recombinant genetic element or elements having a regulatory role in 
gene expiession, for example, promoters or enhancers. Recombinant e}q}ression systems as 
defined herein will express polypeptides or proteins endogenous to the cell i^n induction of the 
regulatory elements linked to the endogenous DNA segment or gene to be e^qnessed. The cells 

15 can be prokaryotic or eukaryotic. 

The tenn "secreted" includes a protein that is tranqmrted across or through a membrane, 
including transport as a result of signal sequences in its amino acid sequence when it is 
expressed in a suitable host cell. "Secreted" proteins include without limitation proteins secreted 
"^olly {e.g., soluble proteins) or partially(e.g., receptors) fiom the cell in vkdiich they are 

20 expressed. "Secreted" proteins also include without limitation proteins that are transported 
across the membrane of the endoplasmic reticulum. "Secreted" proteins are also intended to 
include proteins containing non-typical signal seqiiences (e.g. Interleukin-1 Beta, see Krasney, 
PA. and Young, P JL (1992) Cytokine 4(2):134 -143) and fectors released firom damaged cells 
(e.g. Interleukin-1 Receptor Antagonist, see Arend, WJP. et al. (1998) Annu. Rev. ImmimoL 

25 16:27-55) 

Where desired, an e}q>ression vector may be designed to contain a "signal or leader 
sequence" which will direct the polypeptide through the membrane of a cell. Such a sequence 
m^ be naturally present on the polypeptides of die present invention or provided from 
heterologous protein sources by recombinant DNA techniques. 

30 Hie term "stringent" is used to refer to conditions that are commonly understood in the 

art as stringent Stringent conditions can include highly stringent conditions (i.e., hybridization 
to filter-bound DNA in 0.5 M NaHP04, 7% sodium dodecyl sulfate (SDS), 1 mM EDTA at 
65°C, and washing m O.IX SSC/0.1% SDS at 68^0), and moderately stringent conditions (ie., 
washing in 0.2X SSC/0.1% SDS at 42°C). Other exemplary hybridization conditions are 

35 described herein in the examples. 
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In instances of hybridization of deoxyoligonucleotides, additional exemplary stringent 
hybridi2ation conditions include washing in 6X SSC/0.05% sodium pyrophosphate at 3TC (for 
14-base oligonucleotides), 48°C (for 17-base oligos), SS^C (for 20-base oligonucleotides), and 
60'*C (for 23-base oligonucleotides). 
5 As used herein, "substantially equivalent" can refer both to nucleotide and amino add 

sequences, for example a mutant sequence, that varies from a reference sequence by one or more 
substitutions, deletions, or additions, the net effect of which does not result in an adverse 
functional dissimilarity between the reference and subject sequences. Typically, such a 
substantially equivalent sequence varies from one of those listed herein by no more than about 

10 35% (i.e., the number of individual residue substitutions, additions, and/or deletions in a 

substantially equivalent sequence, as compared to the corresponding reference sequence, divided 
by the total number of residues in the substantially equivalent sequence is about 0.35 or less). 
Such a sequence is said to have 65% sequence identity to the listed sequence. In one • 
embodiment, a substantially equivalent, e.g., mutant, sequence of the invention varies fcom a 

15 listed sequence by no more than 30% (70% sequence identity); in a variation of this 

embodiment, by no more than 25% (75% sequence identity); and in a iurther variation of this 
embodiment, by no more than 20% (80% sequence identity) and in a frirther variation of this 
embodiment, by no more than 10% (90% sequence identity) and in a further variation of this 
embodiment, by no more that 5% (95% sequence identity). Substantially equivalent, e.g., 

20 mutant, amino acid sequences according to the invention preferably have at least 80% sequence 
identity with a listed amino acid sequence, more preferably at least 85% sequence identity, more 
preferably at least 90% sequence identity, more preferably at least 95% identity, more preferably 
at least 98% identity, and most preferably at least 99% identity. Substantially equivalent 
nucleotide sequences of tiie invention can have lower percent sequence identities, taking into 

25 accoimt, for example, the redundancy or degeneracy of the genetic code. Preferably, nucleotide 
sequence has at least about 65% identity, more preferably at least about 75% identity, more 
preferably at least about 80% sequence identity, more preferably at least about 85% sequence 
identity, more preferably at least about 90% sequence identity, and most preferably at least about 
95% identity, more preferably at least about 98% sequence identity, and most preferably at least 

30 about 99% sequence identity. For the purposes of the present invention, sequences having 

substantially equivalent biological activity and substantially equivalent e^qpression characteristics 
are considered substantially equivalent. For the purposes of determirung equivalence, truncation 
of the mature sequence (e.g., via a mutation which creates a spurious stop codon) should be 
disregarded. Sequence identity may be determined, e.g., using the Jotun Hein method (Hein, J. 
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(1990) Methods Enzymol. 183:626-645). Ideaatity between sequences can also be determined by 
other methods known in the art, e.g. by varying hybridization conditions. 

The term "totipotenf ' refers to the capability of a cell to differentiate into all of flie cell 
types of an adult organism. • 
5 The term "transformation'' means introducing DNA into a suitable host cell so that the 

DNA is replicable, either as an extrachromosomal element, or by chromosomal integration. Hie 
term "transfection" refers to the taking up of an expression vector by a suitable host cell, vrfbether 
or not any coding sequences are in feet expressed. The term "infection" refiars to the introduction 
of nucleic acids into a suitable host cell by use of a virus or viral vector. 

10 As used herein, an "uptake modulating fragment," UMF, means a s^es of nucleotides 

which mediate the uptake of a linked DNA fragment into a cell. UMFs can be readily identified 
usiag known UMFs as a target sequence or target motif with the computer-based systems 
described below. The presence and activity of a UMF can be confirmed by nftnfliing ^be 
suspected UMF to a madcer sequence. The resulting nucleic acid molecule is then incubated 

1 5 with an appropriate host under appropriate conditions and the uptake of the marker sequence is 
determined. As described above, a UMF will increase the fi^uency of uptake of a linked 
marker sequence. 

Each of the above terms is meant to encompass all that is described for each, unless the 
context dictates otherwise. 

20 

42, NUCXmC ACIDS OF THE INVENTION 

Nucleotide sequences of the invention are set forth in the Sequence Listing. 
The isolated polynucleotides of the invention include a polynucleotide comprising the 
nucleotide sequences of SEQ ID NO:1-1350 ; a polynucleotide encoding any one of the peptide 

25 sequQoces of S£QIDNO:1351-2700; and a polynucleotide comprising the nucleotide sequence 
encoding the mature protein coding sequence of the polypeptides of any one of SEQ ID 
NO:1351-2700. The polynucleotides of the present invention also include, but are not limited to, 
a polynucleotide that hybridizes under stringent conditions to (a) the complement of any of the 
nucleotides sequences of SEQ ID NO:1-1350 ; (b) nucleotide sequences encoding any one of the 

30 amino acid sequences set forth in the Sequence Listing; (c) a polynucleotide which is an allelic 
variant of any polynucleotide recited above; (d) a polynucleotide \^^ch encodes a species 
homolog of any of the proteins recited above; or (e) a polynucleotide that encodes a polypeptide 
comprising a specific domain or truncation of the polypeptides of SEQ ID NO: 1 35 1 -2700. 
Domains of interest may depend on the nature of the encoded polypeptide; e.g., domains in 

35 receptor-Uke polypeptides include ligand-binding, extracellular, transmembrane, or cytoplasaac 
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domains, or combinations thereof; domains in immunoglobulin-like proteins include the variable 
immunoglobulin-like domains; domains in enzyme-like polypeptides include catalytic and 
substrate binding domains; and domains in ligand polypeptides include receptor-binding 
domains. 

5 The polynucleotides of the invention include naturally occurring or ■v^oUy or partially 

synthetic DNA, e.g., cDNA and genomic DNA, and RNA, e.g., mRNA. The polynudeolides 
may include all of the coding region of the cDNA or may represent a portion of the coding 
region of the cDNA. 

The present invention also provides genes correspondingto the cDNA sequences disclosed 
10 herein. The corresponding g^es can be isolated in accordance with known methods iising the 

sequence information disclosed herein. Such methods include the preparation of probes or primers 
from the disclosed sequence information for identification and/or amplification of genes in 
appropriate genomic libraries or other sources of genomic materials. Further 5' and 3' sequence can 
be obtained ming methods known in the art For example, full length cDNA or genomic DNA that 
15 corresponds to any of the polynucleotides of SEQ ID NO: 1-1 350 can be obtained by SCTeening 
appropriate cDNA or genomic DNA libraries under suitable hybridization conditions using any of 
Hie polynucleotides of SEQ ID NO: 1-1350 or a portion thereof as a probe. Alternatively, the 
polynucleotides of SEQ ID NO:1-1350 may be used as Ihe basis for suitable primer(s)that allow 
identification and/or amplification of genes in appropriate genomic DNA or cDNA libraries. 
20 The nucleic acid sequences of (he invention can be assembled firanESTs and sequences 

(including cDNA and genomic sequences) obtained fix)m one or more public databases, such as 
dbEST, gbpri, and UniGene. The EST sequences can provide identifying sequence information, 
representative fragment or segment information, or novel segment information for the full-length 
gene. 

25 The polynucleotides of the invention also provide polynucleotides including nucleotide 

sequences that are substantially equivalent to the polynucleotides recited above. Polynucleotides 
according to the invention can have, e.g., at least about 65%, at least about 70%, at least about 
75%, at least about 80%, 81%, 82%, 83%, 84%, more typically at least about 85%, 86%, 87%, 
88%, 89%, more typically at least about 90%, 91 %, 92%, 93%, 94%, and even more typically at 

30 least about 95%, 96%, 97%, 98%, 99%, sequence identity to a polynucleotide recited above. 

Included within the scope of the nucleic acid sequences of the invention are nucleic acid 
sequence fiagments that hybridize under shingent conditions to any of tiie nucleotide sequences 
of SEQ ID NO:1-1350, or complements thereof, i^ch fi:agment is greater than about 5 
nucleotides, preferably 7 nucleotides, more preferably greater than 9 nucleotides and most 

35 preferably greater than 17 nucleotides. Fragments of, e.g. 15, 17, or 20 nucleotides or more that 
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are selective for (ie. speclQcally hybridize tx> any one of the polynucleotides of the invention) 
are contemplated. Probes enable of specifically hybridizing to a polynucleotide can 
differentiate polynucleotide sequences of the invention from atioex polynucleotide sequences in 
the same family of genes or can dififerentiate human genes from genes of other species, and are 
5 preferably based on unique nucleotide sequences. 

The sequences falling wilhia the scope of the present invention are not limited to these 
specific sequences, but also include allelic and spedes variations thereof Allelic and species 
variations can be routinely determined by comparing the sequence provided SEQ ID NO : 1 - 1 3 SO, a 
representativefragment thereof, or a nucleotide sequence at least 90% identical, preferably 95% 

1 0 identical, to SEQ ID NO: 1 - 1 3S0 with a sequence from another isolate of the same spedes. 
Furthermore, to accommodate codon variability, the invention includes nucleic add molecules 
coding for the same amino add sequences as do the specific ORFs disclosed herein. In other -words, 
in the coding region of an ORF, substitution of one codon for another codon that encodes the same 
amino add is e^res£dy contemplated. 

1 5 The nearest ndghbor or homology result for the nucleic adds of the present invention, 

includingSEQ ID NO:1-13SO, can be obtained by seaichinga database using an algorithm or a 
program. Preferably, a BLAST which stands for Basic Local Alignment Search Tool is used to 
search for local sequence alignments (Altshul, S JF. J Mol. Evol. 36 290-300 (1 993) and Altschul 
Sf.etal. J. Mol. Biol. 21:403-410 (1990)). Altemativdy a FASTA versions searchagainst 

20 Genpept, using Fastxy algorithm. 

Species homologs (or orthologs) of the disclosed polynucleotides and proteins are also 
provided by the present invration. Spedes homologs may be isolated and identified by making 
suitable probes or primers from the sequences provided herein and screening a suitable nucleic 
acid source fit>m the desired species. 

25 ° The invention also encompasses allelic variants of the disclosed polynucleotides or 

proteins; that is, naturally-occurring alternative forms of the isolated polynucleotide which also 
encode proteins which are identical, homologous or related to that encoded by the 
polynudeotides. 

The nucleic add sequences of the invention are further directed to sequences which 
30 encode variants of the described nucldc acids. These amino add sequence variants may be 
prepared by methods known in the art by introducing appropriate nucleotide changes into a 
native or variant polynucleotide. There are two variables in the construction of amino add 
sequence variants: the location of the mutation and the nature of the mutation. Nucldc adds 
encoding the amino acid sequence variants are preferably constructed by mutating the 
35 polynucleotide to encode an amino acid sequence that does not occur in nature. These nucleic 
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acid alterations can be made at sites that differ in the nucleic acids firom different species 
(variable positions) or in highly conserved regions (constant regions). Sites at such locations 
will typically be modified in series, e.g., by substituting first with conservative choices (e.g-., 
hydrophobic amino acid to a different hydrophobic amino acid) and then with more distant 
5 choices (e.g. , hydrophobic amino acid to a charged amino acid), and then deletions or insertions 
may be made at the target site. Amino acid sequence deletions generally range fix>m about 1 to 
30 residues, preferably about 1 to 10 residues, and are typically contiguous. Amino acid 
insertions include amino- and/or carbo:!^l-terminal fiisions ranging in length from one to one 
himdred or more residiies, as well as intrasequence insertions of single or multiple amino acid 

10 residues. Intrasequence insertions may range generally from about 1 to 10 amino residues, 

preferably fix>m 1 to 5 residues. Examples of terminal insertions include the heterologous signal 
sequences necessary for secretion or for intracellular targeting in different host cells and 
sequences such as FLAG or poly-histidine sequences useful for purifying the expressed protein. 
In a preferred method, polynucleotides encoding the novel amino acid sequences are 

IS changed via site-directed mutagenesis. This method uses oligonucleotide sequences to alter a 
polynucleotide to encode the desired amino acid variant, as weU as sufBcient adjacent 
nucleotides on both sides of the changed amino add to form a stable duplex on either side of the 
site of being changed. In general, the techniques of site-directed mutagenesis are well known to 
those of skill in the art and this technique is exemplified by publications such as, Edehnan et al., 

20 DNA 2:183 (1983). A versatile and ef&cient method for producing site-specific changes in a 

polynucleotide sequence was published by ZoUer and Smith, Nucleic Acids Res. 10:6487-6500 . 
(1982). PCR may also be used to create amino acid sequence variants of the novel nucleic acids. 
When small amounts of template DNA are used as starting material, primei(s) that differs 
slightly in sequence from the corresponding region in the template DNA can generate the desired 

25 amino acid variant PCR amplification results in a population of product DNA fiagments that 
differ from the polynucleotide template encoding the polypeptide at the position specified by the 
primer. The product DNA fiagments replace the corresponding region in the plastnid and this 
gives a polynucleotide encoding the desired amino acid variant 

A fiirther technique for generating amino acid variants is the cassette mutagenesis 

30 technique described in Wells et al.. Gene 34:315 (1985); and other mutagenesis techniques well 
known in the art, such as, for example, the techniques in Sambrook et al., supra, and Current 
Protocols in Molecular Biology, Ausubel et al. Due to the inherent degeneracy of the genetic 
code, other DNA sequences which encode substantially the same or a functionally equivalent 
amino acid sequence may be used in the practice of the invention for the cloning and expression 
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' of these novel nucleic acids. Such DNA sequences include those which are capable of 
hybridizing to the appropriate novel nucleic acid sequence under stringent conditions. 

Polynucleotides encoding preferred polypeptide truncations of the invention can be used 
to generate polynucleotides encoding chimeric or fosion proteins conqirising one or more 
5 domains of the invention and heterologous protein sequences. 

The polynucleotides of the invention additionally include the complement of any of the 
polynucleotides recited above. The polynucleotide can be DNA (genomic, cDNA, amplified, or 
synthetic) or RNA. Methods and algoritimis for obtaining such polynucleotides are well known 
to those of skill in the art and can include, for example, methods fot determining hybridization 

1 0 conditions that can routinely isolate polyniideotides of the desired sequence identities. 

In accordance vnth the invention, polynucleotide sequences comprising the mature 
protein coding sequences corresponding to any one of SEQ ID NO:1-1350, or functional 
equivalents thereof, may be used to generate recombinant DNA molecules that direct the 
expression of that nucleic acid, or a functional equivalent thereof in appropriate host cells. Also 

1 5 included are ttie cDNA inserts of any of the clones identified herein. 

A polynucleotide according to the invention can be joined to any of a variety of other 
nucleotide sequences by well-established recombinant DNA techniques (see Sambrook J et al. 
(1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, NY). Usefiil 
nucleotide sequences for joining to polynucleotides include an assortment of vectors, e.g., 

20 plasmids, cosmids, lambda phage derivatives, phagemids, and the like, that are well known in the 
art. Accordingly, the invention also provides a vector including a polynucleotide of the 
invention and a host cell containing the polynucleotide. In general, the vector contains an origin 
of replication functional in at least one organism, convenient restriction endonuclease sites, and a 
selectable marker for ttie host cell. Vectors according to the invention include ejcpression 

25 vectors, replication vectors, probe generation vectors, and sequencing vectors. A host cell 
according to the invention can be a prokaryotic or eukaryotic cell and can be a unicellular 
organism or part of a multicellular organism. 

The present invention finther provides recombinant constnicts comprising a nucleic acid 
having any of the nucleotide sequences of SEQ ID NO:1-1350 or a fiagment thereof or any other 

30 polynucleotides of the invention. In one embodiment, the recombinant constructs of the present 
invention comprise a vector, such as a plasmid or viral vector, into which a nucleic acid having 
any of the nucleotide sequences of SEQ ID NO:1-1350 or a fitigment thereof is inserted, in a 
forward or reverse orientation. In tiie case of a vector comprising one of the ORFs of thcpresent 
invention, the vector may fijrther comprise regulatory sequences, includmg for example, a 

35 promoter, oparably linked to the ORF. Large numbers of suitable vectors and promoters are 
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known to those of skill in tlie art and are commercially available for generating the recombinant 
constructs of the present invention. The following vectors are provided by way of example. 
Bacterial: pBs, phagescript, PsiX174, pBhiescript SK, pBs KS, pNH8a, pNH16a, pNHlSa, 
pNH46a (Stratagene); pTrc99A, pKK223-3, pKK233-3, pDR540, pRIT5 (Phannacia). 
5 Eukaryotic: pWLneo, pSV2cat, pOG44, PXTI, pSG (Stratagene) pSVK3, pBPV, pMSG, pSVL 
(Pharmacia). 

The isolated polynucleotide of ibe invention may be operably Unked to an expression 
control sequence such as the pMT2 or pED egression vectors disclosed in Kaufinan et aL, 
Nucleic Acids Res. 19, 4485-4490 (1991), in order to produce the protein recombinantly. Many 

10 suitable expression control sequences are known in the art General methods of expressing 
recombinant proteins are also known and are exemplified in R. Kaufinan, Methods in 
Enzymology 185, 537-566 (1990). As defined herein "operably linked" means that the isolated 
polynucleotide of the invention and an e}q)rBSsion control sequence are situated within a vector 
or cell in such a way that the protein is esqiressed by a host cell which has been transformed 

1 5 (transfected) with the hgated polynucleotide/expression control sequence. 

Promoter regions can be selected fit>m any desired gene using CAT (chloramphenicol 
transferase) vectors or other vectors with selectable markers. Two appropriate vectors are 
pKK232-8 and pCM7. Particular named bacterial promoters include laci, lacZ, T3, T7, gpt, 
lambda PR, and trc. Eukaryotic promoters include CMV immediate early, HSV thymidine 

20 kinase, early and late SV40, LTRs fi-om retrovirus, and mouse metallothionein-I. Selection of 
the appropriate vector and promoter is well witiiin the level of ordinaiy skill in the art 
Generally, recombinant expression vectors will include origins of replication and selectable 
markers permitting transformation of the host cell, e.g., the ampicillin resistance gene of E. coli 
and S. cerevisiae TRP 1 gene, and a promoter derived fi-om a highly-expressed gene to direct 

25 transcription of a downstream structural sequence. Such promoters can be derived fiom operons 
encoding glycolytic ettzymes such as 3-phosphoglycerate kinase (PGK), a-fector, acid 
phosphatase, or heat shock proteins, among others. The heterologous structural sequence is 
assembled in a^ropriate phase with translation initiation and termination sequences, and 
preferably, a leader sequence enable of directing secretion of translated protein into the 

30 periplasmic space or extracellular medium. OptionaUy, the heterologous sequence can encode a 
fiision protein iacluding an amino terminal identification peptide imparting desired 
characteristics, e.g., stabilization or simplified purification of expressed recombinant product 
UsefiJ expression vectors for bacterial use are constructed by inserting a structural DNA 
sequence encoding a desired protein together with suitable translation initiation and termination 

35 signals in operable reading phase with a fimctional promoter. The vector will comprise one or 
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more phenotypic selectable markers and an origin of replication to ensure maintenance of the 
vector and to, if desirable, provide amplification v/iHan the host Suitable prokaryotic hosts for 
transformation include E. colt. Bacillus subtilis. Salmonella typhimurium and various species 
within the genera Pseudomonas, Streptomyces, and Stcq)hylococcus, although others may also be 
5 employed as a matter of choice. 

As a representative but non-limiting example, usefiil expression vectors for bacterial use 
can comprise a selectable marker and bacterial origin of replication derived fix)m commercially 
available plasmids comprising genetic elements of the well known cloning vector pBR322 
(ATCC 37017). Such commercial vectors include, for example, pKK223-3 (Pharmacia Fine . 

10 Chemicals, Uppsala, Sweden) and GEM I (Promega Biotech, Madison, WI, USA). These 
pBR322 "backbone" sections are combined with an etppropriate promoter and the structural 
sequence to be expressed. Following transformation of a suitable host strain and growth of Ibe 
host strain to an appropriate cell density, the selected promoter is induced or derepressed by 
^propriate means {e.g., temperature shift or chemical induction) and cells are cultured for an 

15 additional period. Cells are typically harvested by centiifugation, disrupted by physical or 
chemical means, and the resulting crude extract retained for further purification. . 

Polynucleotides of the invention can also be used to induce immune responses. For 
example, as described in Fan et aL, Nat. Biotech. 17:870-872 (1999), incorporated herein by 
reference, nucleic acid sequences encoding a polypeptide may be used to generate antibodies 

20 against the encoded polypeptide following topical administration of naked plasmid DNA or 
following injection, and preferably intramuscular injection of the DNA. The nucleic acid 
sequences are preferably inserted in a recombinant expression vector and may be in the form of 
naked DNA. 



25 43ANTISENSE 

Another aspect of the invention pertains to isolated antisense nucleic acid molecules that 
ate hybridizable to or complementary to the nucleic acid molecule comprising the nucleotide 
sequence of SEQ ID NO:1-1350, or fiiagmoits, analogs or derivatives thereof. An "antisense" 
nucleic acid comprises a nucleotide sequence that is complementary to a "sense" nucleic acid 

30 encoding a protein, e.g., complementaty to Ifae coding strand of a double-stranded cDNA 

molecule or complementary to an mKNA sequence. In specific aspects, antisense nucleic acid 
molecules are provided that comprise a sequence complementary to at least aboitf 1 0, 25, 50, 
100, 250 or 500 nucleotides or an entire coding strand, or to only a portion thereof. Nucleic acid 
molecules encoding fiagments, homologs, derivatives and analogs of a protein of any of SEQ ID 
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NO: 135 1-2700 or antisense nucleic acids complementary to a nucleic acid sequence of SEQ ID 
NO:1-1350 are additionally provided. 

In one embodiment, an antisense nucleic acid molecule is antisense to a "coding region" 
of the coding strand of a nucleotide sequence of the invention. The term "coding region" refers 
S to the region of the nucleotide sequence comprising codons which are translated into amino acid 
residues. In another embodiment, tiie antisense nucleic acid molecule is antisense to a 
"noncoding region" of the coding strand of a nucleotide sequence of tiie invention. The term 
"noncoding region" refers to 5' and 3' sequences which flank the coding region that are not 
translated into amino acids (i.e., also referred to as 5' and 3' untranslated regions). 

1 0 Given the coding strand sequences encoding a nucleic acid disclosed herein (e.g. , SEQ 

ID NO:1-1350), antisense nucleic acids of the invention can be designed according to the rules 
of Watson and Crick or Hoogsteen base pairing. The antisense nucleic add molecule can be 
complementary to the entire coding region of a mKNA, but more preferably is an oligonucleotide 
fbat is antisense to only a portion of the coding or noncoding region of a mKNA. For example, 

15 the antisense oligonucleotide can he complementary to the region surrounding the translation 
start site of a mRNA. An antisense oligonucleotide can be, for example, about 5, 1 0, 1 5, 20, 25, 
30, 35, 40, 45 or 50 nucleotides in lengQi. An antisense n\icleic acid of the invention can be 
constructed using chemical synthesis or enzymatic ligation reactions using procediires known in 
the art For example, an antisense nucleic acid (e.g., an antisense oligonucleotide) can be 

20 chemically synthesized using naturally occurring nucleotides or variously modified nucleotides 
designed to increase the biological stability of the molecules or to increase the physical stability 
of the diq)]ex formed between the antisense and sense nucleic acids, e.g., phosphorotiiioate 
derivatives and acridine substituted nucleotides can be used. 

Exanqjles of modified nucleotides that can be used to generate the antisense nucleic acid 

25 include: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 
4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5-caiboxymethylanainomethyl- 
2-thiouridine, 5-carboxymethylaminomettiyluracU, dibydrouracil, beta-D-galactosylqueosine, 
inosine,N6-isopeiitenyladeiiine, 1-methylguanine, 1-methylinosine, 2,2-dimetiiylguanine, 
2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-mediylcytosine, N6-adenine, 

30 7-mediylguanine, 5-methylaniinomethyluracil, 5-metfioxyaminomethyl-2<tbiouracil, 
beta-D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 
2-me1hylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 
queosine, 2-thiocytosine, 5-methyI-2-thiouracil, 2-thiouiacil, 4-thiouracil, 5-methyluracil, 
uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 

35 3-(3-amino-3-l>J-2-carboxypropyl) uracil, (aq)3)w, and 2,6-diaminopurine. Ahematively, the 



21 



wo 01/57188 



PCTAISOl/03800 



antisense nucleic acid can be produced biologically using an expression vector into which a 
nucleic acid has been subcloned in an antisense orientation (i.e., RNA transcribed from the 
inserted nucleic acid will be of an antisense orientation to a target nucleic acid of interest, 
described further in the following subsection). 

The antisense nucleic acid molecules of the invention are typically administered to a 
subject or generated in situ such that they hybridize with or bind to cellular mRNA and/or 
genomic DNA encoding a protein according to the invention to thereby inhibit expression of the 
protein, e.g., by inhibiting transcription and/or translation. Tie hybridization can be by 
conventional nucleotide complementarity to form a stable duplex, or, for example, in the case of 
an antisense nucleic acid molecule that binds to DNA duplexes, through speci&c interactions in 
the major groove of the double helix. An example of a route of administration of antisense 
nucleic acid molecules of the invention includes direct injection at a tissue site. Alternatively, 
antisense nucleic acid molecules can be modified to target selected cells and then administered 
systemically. For example, for systemic administration, antisense molecules can be modified 
such that they specifically bind to receptors or antigens expressed on a selected cell surfaice, e.g. , 
by linking the antisense nucleic acid molecules to peptides or antibodies that bind to cell surface 
receptors or antigens. The antisense nucleic acid molecules can also be delivered to cells using 
the vectors described herein. To achieve sufficient intracellular concentrations of antisense 
molecules, vector constructs in whidi the antisense nucleic acid molecule is placed under the 
control of a strong pol II or pol HI promoter are preferred. 

In yet another embodiment, the antisense nucleic acid molecule of the invention is an 
-a n omeric nucleic acid molecule. An -a nomeric nucleic acid molecule forms specific 
double-stranded hybrids with complonentary KNA in which, contrary to the usual -uni ts, the 
strands run parallel to each other (Gaultier et al. (1 987) Nucleic Acids Res 15: 6625-6641). The 
antisense nucleic acid molecule can also comprise a 2'-o-methylribonucleotide (Inoue et al. 
(1987) Nucleic Acids Res 15: 6131-6148) or a chimeric KNA -DNA analogue (Inoue et al. (1987) 
FEES Lett 215: 327-330). 

4.4 RDBOZyMES AND PNA MOIETIES 

In still anotber embodiment, an antisense nucleic acid of the invention is a ribozyme. 
Ribozymes are catalytic RNA molecules with ribonuclease activity that are capable of cleaving a 
single-stranded nucleic acid, such as a mRNA, to which tiiey have a complemoataiy region. 
Thus, ribozymes (e.g., hammerhead ribozymes (described in Haselhoff and Gerlach (1988) 
Nature 334:585-591)) can be used to catalytically cleave a mRNA transcripts to thereby inhibit 
translation of a mRNA. A ribozyme having specificity for a nucleic acid of the invention can be 
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designed based upon the nucleotide sequence of a DNA disclosed herein (/.c, SEQ ID NO:l- 
1350). For example, a derivative of a Tetrahymena L-19 IVS KNA can be constructed in whidi 
the nucleotide sequence of the active site is complementary to the nucleotide sequence to be 
cleaved in a SECX-encoding mRNA. See, e.g., Cech et al. U.S. Pat. No. 4,987,071; and Cech et 
5 al. U.S. Pat No. 5,1 16,742. Alternatively, SECX mRNA can be used to select a catalytic RNA 
having a specific ribonuclease activity from a pool of RNA molecules. See. e.g., Bartel et al., 
(1993) &ze/ice 261:141 1-1418. 

Alternatively, gene expression can be inhibited by targeting nucleotide sequences 
complementary to the regulatory region (e.g. , promoter and/or enhancers) to form triple helical 
10 ' structures that prevent transcription of the gene in target cells. See generally, Helene. (1991) 
Anticancer Drug Des. 6: 569-84; Helene. et al. (1 992) Ann. N. Y. Acad Set 660:27-36; and 
Maher (1992) Bioassays 14: 807-15. 

In various embodiments, the nucleic acids of the invention can be modified at the base 
moiety, sugar moiety or phosphate backbone to improve, e.g. , the stability, hybridization, or 
. IS solubility of the molecule. For example, tbe deoxyribose phosphate backbone of ibe nucleic 
acids can be modified to generate peptide nucleic acids (see Hymp etal.{\ 996) Bioorg Med 
Chem 4: 5-23). As used herein, the terms "peptide nucleic acids" or "PNAs" refer to nucleic acid 
mimics, e.g. , DNA mimics, in which the deoxyribose phosphate backbone is replaced by a 
pseudopeptide backbone and only the four natural nucleobases are retained. Ihe neutral 
20 backbone of PNAs has been shown to allow for specific hybridization to DNA and RNA under 
conditions of low ionic strength. The synthesis of PNA oligomers can be performed using 
standard solid phase peptide synthesis protocols as described in Hyrup et al. (1996) above; 
Perry-O-Keefe et al. (1 996) PNAS 93: 14670-675. 

PNAs of the invention can be used in therapeutic and diagnostic applications. For 
25 example, PNAs can be used as antisense or antigene agents for sequence-specific modulation of 
gene expression by, e.g., inducing transcription or translation arrest or iiihibiting replication. 
PNAs of the invention can also be used, e.g., in tiie analysis of single base pair mutations in a 
gene by, e.g. , PNA directed PGR clamping; as artificial restriction enzymes when used in 
combination with other enzymes, e.g., SI nucleases (Hyrup B. (1996) above); or as probes or 
30 primers for DNA sequence and hybridization (Hynq) et al. (1 996), above; Perry-CKeefe (1 996), 
above). 

In another embodunent, PNAs of the invention can be modified, e.g., to enhance their 
stability or cellular uptake, by attaching lipophilic or other helper groups to PNA, by the 
formation of PNA-DNA chimeras, or by the use of liposomes or other techniques of drug 
35 delivery known in the art. For example, PNA-DNA chimeras can be generated that may 
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combine the advantageous properties of PNA and DNA. Such chimeras allow DNA recognition 
enzymes, e.g., RNase H and DNA polymerases, to interact with the DNA portion while the PNA 
portion would provide high binding affinity and specificity. PNA-DNA chimoas can be linked 
using linkers of ^propriate lengths selected in terms of base stacking, number of bonds between 
5 the nucleobases, and orientation (Hynq) (1996) above). The synthesis of PNA-DNA chimeras 
can be performed as described in Hyrup (1996) above and Firm et al. (1996) Nucl Acids Res 24: 
3357-63. For example, a DNA chain can be synthesized on a solid siqjport using standard 
phosphoramidite coupling chemistry, and modified nucleoside analogs, e.g., 
5'-(4-methoxytrityl)amino-5'-deoxy-thymidine phosphoramidite, can be used between the PNA 
10 and the 5' end of DNA (Mag et al. (1 989) Nucl Acid Res 17: 5973-88). PNA monomers are then 
coiq)led in a stepwise manner to produce a chimoic molecule wi& a 5' PNA segment and a 3' 
DNA segment (Firm et al. (1996) above). Alternatively, chimeric molecules can be synthesized 
with a 5' DNA segment and a 3' PNA segment See, Petersen et al. (1 975) Bioorg Med Chem 
Lett 5: 1119-11124. 

15 In other embodiments, the oligonucleotide may include other upended groins such as 

peptides (e.g. , for targeting host cell receptors m vivo), or agents facilitating transport across the 
cell membrane (see, e.g., Letsinger e/ a/., 1989, Proc. Natl. Acad Sci U.S.A. 86:6553-6556; 
Lemaitre et al., 1 987, Proc. Natl. Acad. Sci. 84:648-652; PCT Publication No. W088/0981 0) or 
tiie blood-brain barrier (see, e.g., PCT Publication No. W089/10134). In addition, 

20 oligonucleotides can be modified with hybridization triggered cleavage agents (See, e.g., Krol et 
al, 1988, BioTechnigues 6:958-976) or intercalating agents. (See, e.g., Zon, 1988, Pharm. Res. 
5: 539-549). To this end, the oligonxicleotide may be conjugated to another moleciile, e.g., a 
peptide, a hybridization triggered cross-linking agent, a transport agent, a hybridization-triggered 
cleavage agent, etc. 

25 ■ 
4.S HOSTS 

The present invention further provides host cells gmetically engineered to contain the 
polynucleotides of the invention. For example, such host cells may contain nucleic acids of the 
invention introduced into the host cell using known transformation, transfection or infection 
30 mediods. The present invention still fiulher provides host cells genetically engineered to express 
the polynucleotides of the invention , wherein such polynucleotides are in operative association 
with a regulatory sequence heterologous to the host cell vAdch drives expression of the 
polynucleotides in the cell. 

Knowledge of nucleic acid sequences allows for modification of cells to permit, or 
35 increase, expression of endogenoiis polypeptide. Cells can be modified (e.g. , by homologous 
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lecombinatioii) to provide increased polypeptide expression by replacing, in whole or in part, the 
naturally occurring promoter with all or part of a heterologous promoter so that the cells express 
the polypeptide at higher levels. The heterologous promoter is inserted in such a manner that it 
is operatively linked to the encoding sequences. See, for example, PCT International Publication 
5 No. WO94/12650, PCT International Publication No. WO92^0808, and PCT International 

Publication No. WO91/0995S. It is also contemplated that, in addition to heterologous promoter 
DNA, amplifiable marker DNA (e.g., ada, dhfr, and the multifunctional CAD gene which 
encodes carbamyl phosphate synthase, aspartate transcarbamylase, and dibydroorotase) and/or 
intron DNA may be inserted along with the heterologous promoter DNA. If linked to the coding 

10 sequence, amplification of the marker DNA by standard selection methods results in co- 
ampliGcation of tiie desired protein coding sequences in the cells. 

The host cell can be a higher eukaryotic host cell, such as a mammalian cell, a lower 
eukaryotic host cell, such as a yeast cell, or the host cell can be a prokaryotic cell, such as a 
bacterial ceU. Introduction of the recombinant construct into the host cell can be effected by 

15 calcium phosphate transfection, DEAE, dextran mediated transfection, or electroporatton (Davis, 
L. et al., Basic Methods in Molecular Biology (1986)). The host cells containing one of the 
polynucleotides of the invention, can be used in conventional manners to produce the gene 
product encoded by the isolated fragment (in the case of an ORF) or can be used to produce a . 
heterologous protein under the control of the EMF. 

20 Any hostA^ector system can be used to express one or more of the ORFs of tiie present 

invention. These include, but are not limited to, eukaryotic hosts such as HeLa cells, Cv-1 cell, 
COS cells, 293 cells, and Sf9 cells, as well as prokaryotic host such as E. coli and B. subtilis. 
The most preferred cells are those which do not normally express the particular polypeptide or 
protein or which expresses the polypeptide or protein at low natural level. Mature proteins can 

25 be ejqiressed in mammalian cells, yeast, bacteria, or other cells imder the control of ^propriate 
promoters. Cell-firee translation systems can also be employed to produce such proteins using 
RNAs derived fiom the DNA constructs of the present inventioiL Appropriate cloning and 
expression vectors for use with prokaryotic and eukaryotic hosts are described by Samfarook, et 
al., in Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, New 

30 York (1989), the disclosure of which is hereby incorporated by reference. 

Various mammalian cell culture systems can also be employed to express recombinant 
protein. Examples of mammalian expression systems include the COS-7 lines of monkey kidney 
fibroblasts, described by Gluzman, Cell 23:175 (1981). Other cell lines capable of e^qnessing a 
compatible vector are, for example, the C127, monkey COS cells, Chinese Hamster Ovary 

35 (CHO) cells, human kidney 293 cells, human epidermal A43 1 cells, human Colo205 cells, 3T3 
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cells, CV-1 cells, other transfonaed primate ceU lines, nonnal diploid cells, cell strains derived 
fix)m in vitro culture of primary tissue, primary ejqjlants, HeLa cells, mouse L cells, BHK, 
HL-60, U937, HaK or Jurkat cells. Mammalian expression vectors will comprise an origin of 
replication, a suitable pronioter and also any necessary ribosome binding sites, polyadenylation 
5 site, splice donor and acceptor sites, transcriptional termination sequences, and 5' flanking 
nontranscribed sequences. DNA sequences derived fiom the SV40 viral genome, for example, 
SV40 origin, early promoter, enhancer, splice, and polyadenylation sites may be used to provide 
the required nontranscribed genetic elements. Recombinant polypeptides and proteins produced 
in bacterial culture are usually isolated by initial extraction from cell pellets, followed by one or 

10 more salting-out, aqueous ion exchange or size exclusion chromatography steps. Protein 
refolding steps can be used, as necessary, in completing configuration of the mature proteiiL 
Finally, high performance liquid chromatography (HPLC) can be employed for final purification 
steps. Microbial cells employed in expression of proteins can be disrupted by any convenient 
method, including fieeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing 

15 agents. 

Alternatively, it may be possible to produce the protein in lower eiikaxyotes such as yeast 
or insects or in prokaiyotes such as bacteria. Potentially suitable yeast strains include 
Saccharomyces cerevisiae, Schizosaccharomyces pombe, Kltfyveromyces strains, Candida, or 
any yeast strain capable of expressing heterologous proteins. Potentially suitable bacterial 

20 strains include Escherichia coli, Bacillus subtilis, Salmonella typhimurium, or any bacterial 

strain capable of expressing heterologous proteins. If the protein is made in yeast or bacteria, it 
may be necessary to modify the protein produced herein, for example by phosphorylation or 
glycosylation of tihe appropriate sites, in order to obtain the functional proteiiL Such covalent 
attachments may be accomplished using known chemical or enzymatic methods. 

25 In another embodiment of the present invention, cells and tissues may be engineered to 

express an endogenous gene comprising the polynucleotides of the invention under the control of 
inducible regulatory elements, in v^ch case the regulatory sequences of tlie endogenous gene 
may be replaced by homologous recombination. As described herein, gme targeting can be used 
to replace a gene's existing regulatory region with a regulatory sequence isolated fiom a different 

30 gene or a novel regulatory sequence synthesized by genetic engineering methods. Such 

regulatory sequences may be comprised of prompters, enhancCTS, scaffold-attachment regions, 
negative regulatory elements, transcriptional initiation sites, regulatory protein binding sites or 
combinations of said sequences. Alternatively, sequences which aSect the structure or stability 
of the RNA or protein produced may be replaced, removed, added, or odierwise modified by 

35 targeting. These sequence include polyadenylation signals, mKNA stability elements, splice 
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sites, leader sequences for enhancing or modifying transport or secretion properties of the 
protein, or oflier sequences which alter or improve the function or stability of protein or RNA 
molecules. 

The targeting event may be a simple insertion of the regulatory sequence, placing the 
5 gene under the control of the new regulatory sequence, e.g., inserting a new promoter or 

enhancer or both iq>stream of a gene. Alternatively, the targeting event may be a simple deletion 
of a regulatory element, such as the deletion of a tissue-specific negative regulatory element 
Alternatively, the targeting event may replace an existing element; for example, a tissue-specific 
enhancer can be lepliaced by an enhancer that has broader or difierent cell-type specificity than 

10 the naturally occurring elements. Here, the naturally occurring sequences are deleted and new 
sequences are added. In all cases, the identification of the targeting event may be facilitated by 
the use of one or more selectable marker genes that are contiguous with the targeting DNA, 
allowing for the selection of cells in which the exogenous DNA has integrated into the host cell 
genome. The identification of the targeting event may also be &cilitated by the use of one or 

15 more maricer genes exhibiting tiie property of negative selection, such that the negatively 
selectable marker is linked to the exogenous DNA but configured such that the negatively 
selectable marker flanks the targeting sequence, and such that a correct homologous 
recombination event with sequences in the host cell genome does not result in the stable 
integration of the negatively selectable marker. Markers useful for this purpose include the 

20 Herpes Sitr^lex Virus thymidine kinase (TK) gene or the bacterial xanthine-guanine 
phosphoribosyi-transferase (gpt) gene. 

The gene targeting or gene activation techniques which can be used in accordance with 
this aspect of the invention are more particularly described in U.S. Patent No. 5,272,071 to 
CJh^pel; U.S. Patent No. 5,578,461 to Sherwin et al.; International Application No. 

25 PCT/US92/09627 (WO93/09222) by Selden et al.; and IntEroationaJ Application No. 

PCTAJS90/06436 (WO91/06667) by Skoultchi et al., each of which is incorporated by reference 
herein in its entirety. 

4.6 POLYPEPTTOES OF IHE INVENTION 

30 The isolated polypeptides of the invention include, but are not limited to, a polypeptide 

comprising: the amino acid sequences set forth as any one of SEQ ID NO: 1 35 1 -2700 or an 
amino acid sequence encoded by any one of the nucleotide sequaices SEQ ID NO: 1-1 3 50 or the 
corresponding full length or mature proteiru Polypeptides of the invention also include 
polypeptides preferably with biological or immunological activity tiiat are encoded by: (a) a 

35 polynucleotide having any one of the nucleotide sequences set forth in SEQ ID NO:1-1350 or (b) 
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polynucleotides encoding any one of the amino acid sequences set fortihi as SEQ ID NO: 1351- 
2700 or (c) polynucleotides that hybridize to the complement of the polynucleotides of either (a) 
or (b) mder stringent hybridization conditions. The invention also provides biologically active 
or immunologically active variants of any of the amino acid sequences set forth as SEQ ID 
5 NO:1351-2700 or the corresponding foil length or mature protein; and "substantial equivalents" 
thereof (e.g., with at least about 65%, at least about 70%, at least about 75%, at least about 80%, 
at least about 85%, 86%, 87%, 88%, 89%, at least about 90%, 91%, 92%, 93%, 94%, typically at 
least about 95%, 96%, 97%, more typically at least about 98%, or most typically at least about 
99% amino acid identity) that retain biological activity. Polypeptides encoded by allelic variants 
10 may have a similar, inca:ieased, or decreased activity compared to polypeptides comprising SEQ 
IDNO:135l-2700. 

Fragments of the proteins of the present invention yMch are capable of exhibiting 
biological activity are also encompassed by the present invention. Fragments of the protein may 
be in linear form or they may be cyclized using known metiiods, for example, as described in H. 

15 U. Saragovi, et al., Bio/Technology 10, 773-778 (1992) and in R. S. McDowell, et al., J. Amer. 
Chem. Soc. 1 14, 9245-9253 (1992), both of vsdiich are incorporated herein by reference. Such 
fragments may be fused to carrier molecules such as immunoglobulins for many purposes, 
including increasing the valency of protein binding sites. 

The present invention also provides both full-length and mature forms (for example, 

20 without a signal sequence or precursor sequence) of the disclosed proteins. The protein coding 
sequence is identified in the sequence listing by translation of the disclosed nucleotide 
sequences. The mature form of such protein may be obtained by expression of a fidl-length 
polynucleotide in a suitable mammalian cell or other host cell. The sequence of the mature form 
of the protein is also determinable fix>m the amino acid sequence of the fidl-length form. Where 

25 proteins of the present invention are membrane boimd, soluble forms of the proteins are also 
provided. In such forms, part or all of the regions causing the proteins to be membrane bound 
are deleted so that the proteins are fidly secreted fiom the cell in which they are expressed. 

Protein compositions of the present invention may fiirther comprise an acceptable carrier, 
such as a hydrophilic, e.g., pharmaceutically acceptable, carrier. 

30 The present invention fiirther provides isolated polyx)eptide3 encoded by the nucleic acid 

figments of the present invention or by degenerate variants of the nucleic acid j&agments of the 
present invention. By "degenerate variant" is intended nucleotide fragments A^ch differ fiom a 
nucleic acid fragment of the present invention (e.g., an ORF) by nucleotide sequence but, due to 
tiie degeneracy of the genetic code, encode an identical polyp^itide sequence. Preferred niicleic 

35 acid fragments of the present invention are the ORFs that encode proteins. 
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A variety of methodologies known in the art can be utilized to obtain any one of the 
isolated polypeptides or proteins of the present inventioiL At the simplest level, the amino acid 
sequence can be synthesized using commercially available peptide synthesizers. The 
synthetically-constructed protein sequences, by virtue of sharing primary, secondary or tertiary 
5 structural and/or conformational characteristics with proteins may possess biological properties 
in common therewith, including protein activity. This technique is particularly useful in 
producing small peptides and fragmoits of larger polypeptides. Fragments are useful, for 
example, in generating antibodies against the native polypeptide. Thus, they may be employed 
as biologically active or immunological substitutes for natural, purified proteins in screening of 

10 therapeutic compounds and in immunological processes for tiie development of antibodies. 

The polypeptides and proteins of the present invention can alteroiitively be purified firom 
cells which have been altered to express the desired polypeptide or protein. As us6d herein, a 
cell is said to be altered to express a desired polypeptide or protein when the cell, through genetic 
manipulation, is made to produce a polypeptide or protein which it normally does not produce or' 

1 S which the cell normally produces at a lower level. One skilled in the art can readily adapt 
procedures for introdudng and expressing either recombioant or syntittetic sequences into 
eukaryotic or prokaryotic cells in order to generate a cell which produces one of the polypeptidbs 
or proteins of the present invention. 

The invention also relates to methods for producing a polypeptide comprising growing a 

20 culture of host cells of the invention in a suitable culture medium, and purifying the protein from 
the cells or the culture in w^ch the cells are grown. For example, the methods of the invention 
include a process for producing a polypq)tide in ^^ch a host cell containing a suitable 
expression vector that includes a polynucleotide of the invention is cultured under conditions that 
allow eiqiression of the encoded polypeptide. The polypeptide can be recovered finm the 

25 culture, conveniently finm the culture medium, or firom a lysate prepared fit>m the host cells and 
finther purified. Preferred embodiments include those in which the protein produced by such 
process is a fiiU length or mature form of the protein. 

In an alternative method, the polypq)tide or'protein is purified fix>m bacterial cells which 
naturally produce the polypeptide or proteiiL One skilled in the art can readily follow known 

30 methods for isolating polypeptides and proteins in order to obtain one of the isolated 
polypeptides or proteins of the present invention. These include, but are not limited to, 
unmunochromatography, HPLC, size-exclusion chromatography, ion-exchange chromatography, 
and immuno-afOnity chromatography. See, e.g.. Scopes, Protein Purification: Principles and 
Practice, Springer-Verlag (1994); Sambrook, et al., in Molecular Cloning: A Laboratory 

35 Manual; Ausubel et al.. Current Protocols in Molecular Biology. Polypeptide fixigments that 
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retain biological/immunological activity include fragments comprising greater than about 100 
amino acids, or greater than about 200 amino acids, and fragments tiiat encode specific protein 
domains. 

The purified polypeptides can be used in in -vitro binding assays which are well known in 
5 the art to identify molecules which bind to tiie polypeptides. These molecules include but are not 
limited to, for e.g., small molecules, molecules from combinatorial Ubraries, antibodies or other 
proteins. The molecules identified in the binding assay are then tested for antagonist or agonist 
activity in in vivo tissue culture or animal models that are well known in the art. In brief, the 
molecules are titrated into a plurality of cell cultures or animals and then tested for either 

10 cell/animal death of prolonged survival of the animal/cells. 

In addition, the peptides of the invention or molecules capable of binding to the peptides 
may be complexed with toxins, e.g., ricin or cholera, or with other compoimds that are toxic to 
cells. The toxin-binding molecule complex is then targeted to a tumor or other cell by the 
specificity of the binding molecule for SEQ ID NO: 13 51 -2700. 

1 5 The protein of tiie invention may also be expressed as a product of transgenic animals, 

e.g., as a component of the milk of transgenic cows, goats, pigs, or sheep vk^iich are characterized 
by somatic or germ cells containing a nucleotide sequence encoding the protein. 

The proteins provided herein also include proteins characterized by amino acid sequences 
similar to those of purified proteins but into which modification are naturally provided or 

20 deliberately engineered. For example, modifications, in the peptide or DNA sequence, can be 
made by those skilled in the art using known techniques. Modifications of interest in the protein 
sequences may include the alteration, substitution, replacement, insertion or deletion of a 
selected amino add residue in the coding sequence. For example, one or more of tiie cysteine 
residues may be deleted or replaced with another amino acid to alter the conformation of the 

25 molecule. Techniques for such alteration, substitution, replacement, insertion or deletion are 
well known to tiiose skilled in the art (see, e.g.. U.S. Pat. No. 4,518,584). Preferably, such 
alteration, substitution, replacement, insertion or deletion retains the desired activity of the 
proteirL Regions of the protein that are important for the protein fimction can be determined by 
various metiiods known in the art including the alanine-scanning method vtdiich involved 

30 systematic substitution of single or strings of amino acids with alanine, followed by testing the 
resulting alanine-containing variant for biological activity. This type of analysis determines the 
importance of the substituted amino acid(s) in biological activity. Regions of the protein that are 
important for protein function may be determined by the eMATRIX program. 

Other fi:agments and derivatives of the sequences of proteins v^ch would be expected to 

35 retain protein activity in v^ole or in part and are useful for screening or other immimological 
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methodologies may also be easily made by those skilled in the art given the disclosures herein. 
Such modifications are encompassed by the present invention. 

The protein may also be produced by operably linking the isolated polynucleotide of the 
invention to suitable control sequences in one or more insect expression vectors, and employing 
5 an insect Kq)ression system. Materials and methods for baculovirus/insect cell expression 
systems are commercially available in kit form fi-om, e.g., Invitrogen, San Diego, Calif, U.S.A, 
(the MaxBat™ kit), and such methods are well known in the art, as described in Summers and 
Smith, Texas Agricultural Experiment Station Bulletin No. 1555 (1987), incorporated herein by 
reference. As used herein, an insect cell capable of expressing a polynucleotide of the present 

10 invention is "transformed." 

The protein of the invention may be prepared by culturing transformed host cells under 
culture conditions suitable to express the recombinant protein. The resulting expressed protein 
may then be purified &om such culture (i.e., from culture medium or cell extracts) using known 
purification processes, such as gel filtration and ion exchange chromatogr^hy. The purification 

15 of the protein may also include an afSnity column containing agents which will bind to the 
protein; one or more colunm steps over such affinity resins as concanavalin A-agarose, 
heparin-toyopearl™ or Cibacrom blue 3GA Sepharose™; one or more steps involving 
hydrophobic interaction chromatography using such resins as phenyl ether, butyl ether, or propyl 
ether; or immunoafGnity chromatography. 

20 Alternatively, the protein of the invention may also be expressed in a form which will 

&cilitate purification. For example, it may be expressed as a iiision protein, such as those of 
maltose binding protein (MBP), glutalhione-S-transfcrase (GST) or thioredoxin (TR5Q, or as a 
His tag. Kits for expression and purification of such fusion proteins are commercially available 
fiom New E ng land BioLab (Beverly, Mass.), Pharmacia (Piscataway, N.J.) and Invitrogen, 

25 respectively. The protein can also be tagged with an epitope and subsequently purified by using 
a specific antibody directed to such epitope. One sudh epitope ("FLAG®") is commercially 
available from Kodak (New Hiaven, Corm.). 

Finally, one or more reverse-phase high pofonnance liquid chromatography (RP- HPLC) 
steps employing hydrophobic RP-HPLC media, e.g., silica gel having pendant methyl or other 

30 aliphatic grovqps, can be employed to furdier purify the protein. Some or all of the foregoing 
purification steps, in various combinations, can also be employed to provide a substantially 
homogeneous isolated recombinant protein. Hie protein thus purified is substantially free of 
other m a mm a lian proteins and is defined in accordance widi the preset invraition as an "isolated 
protein." 
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The polypeptides of the invention include analogs (variants). This embraces firagmenls, 
as well as peptides in which one or more amino acids has been deleted, inserted, or siibstituted. 
Also, analogs of the polypeptides of the invention embrace fiisions of the polypeptides or 
modifications of the polypeptides of the invention, vdierein the poljrpeptide or analog is fused to 
S anothermoietyormoieties,e.g., targeting moiety or another therapeutic agent Such analogs 
may exhibit improved properties such as activity and/or stability. Examples of moieties which 
may be fused to the polypeptide or an analog include, for example, targeting moieties which 
provide for the delivoy of polypeptide to pancreatic ceUs, e.g., antibodies to pancreatic cells, 
antibodies to immune cells such as T-cells, monocytes, dendritic cells, granulocytes, etc., as well 
10 as receptor and ligands expressed on pancreatic or immune cells. Oibsx moieties ^niiich may be 
fiised to the polypeptide include therapeutic ^ents which are used for treatment, for example, 
immunosuppressive drugs such as cyclosporin, SK506, azathioprine, CDS antibodies and 
steroids. Also, polypqptides may be fused to immune modulators, and other cytokines such as 
alpha or beta interferoiL 

15 

4.6.1 DETERMINING POLYPEPTIDE AND POLYNUCLEOTIDE IDENTITY 
AND SBVnLAIUTY 

Preferred identity and/or similarity are designed to give the largest match between the 
sequences tested. Methods to determine identity and similarity are codified in computer 

20 programs including, but are not limited to, the GCG program package, including GAP 

(Devereux, J., et al.. Nucleic Acids Research 12(1):387 (1984); Genetics Computer Group, 
University of Wisconsin, Madison, WI), BLASTP, BLASTN, BLASTX, FASTA (Altschul, S.F. 
et al., J. Molec. Biol. 215:403-410 (1990), PSI-BLAST (Altschul S.F. et al.. Nucleic Acids Res. 
vol. 25, pp. 3389-3402, herein incorporated by reference), eMatrix software (Wu et al., J. Comp. 

25 Biol., Vol. 6, pp. 219-235 (1999), herein incorporated by reference), eMotif software (Nevill- 
Manning et al, ISMB-97, Vol. 4, pp. 202-209, herein incorporated by reference), pFam software 
(Sonnhammer et al., Nucleic Acids Res., Vol. 26(1), pp. 320-322 (1998), herein incorporated by 
reference) and the Kyte-Doolittle hydrophobocity prediction algorithm (J. Mol Biol, 157, pp. 
105-31 (1982), incorporated herein by reference). The BLAST programs are publicly available 

30 firom the National Center for Biotechnology Information (NCBI) and other sources (BLAST 
Manual, Altschul, S., et al. NCB NLM NIH Befliesda, MD 20894; Altschul, S., et al., J. Mol. 
Biol. 215:403-410 (1990). 

4.7 CHIMERIC AND FUSION PROTEINS 

The invention also provides chimeric or fusion proteins. As used herein, a "chimeric 

35 protein" or "fusion protein" comprises a polypeptide of the invention operatively linked to 
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another polypqptide. Within a fusion protein the polypeptide according to the invention can 
correspond to all or a portion of a protein according to the invention. In one embodiment, a 
fusion protein comprises at least one biologically active portion of a protein according to the 
invention. In another embodiment, a fusion protein comprises at least two biologically active 
5 portions of a protein according to the invention. Within the fusion protein, the term "operatively 
linked" is intended to indicate that the polypeptide according to the invention and the other 
polypeptide are fused in-fiame to each other. The polypeptide can be fused to the N-terminus or 
C-termiaus. 

For example, in one embodiment a fusion protein comprises a polypeptide according to 

10 the invention operably linked to the extracellular domain of a second protein. 

In another embodiment, the fiision protein is a GST-fusion protein in v^ch the 
polypeptide sequences of ibe invention are fused to ihe C-terminus of the GST {i.e., glutathione 
S-transferase) sequences. 

In anotiier embodiment, Ihe fusion protein is an immunoglobulin fusion protein in which 

15 the polypeptide sequences according to the invention comprises one or more domains are fused 
to sequences derived from a member of the immimoglobulin protein femily. The 
immunoglobulin fusion proteins of the invention can be incorporated into pharmaceutical 
compositions and administered to a subject to inhibit an interaction between a ligand and a 
protein of the invention on the surface of a cell, to thereby suppress signal transduction in vivo. 

20 The immunoglobulin fusion proteins can be used to affect the bioavailability of a cognate ligand. 
Inhibition of the ligand/protein interaction may be useful therapeutically for both the treatment of 
proliferative and differentiative disorders, e,g., cancer as well as modulating (e.g., promoting or 
inhibiting) cell survival. Moreover, the immunoglobulin fusion proteins of the invention can be 
used as immimogens to produce antibodies in a subject, to purify ligands, and in screening assays 

25 to identify molecules that inhibit the interaction of a polypeptide of the invention with a ligand. 

A chimeric or fiision protein of the invention can be produced by standard recombinant 
DNA techniques. For example, DNA fragments codmg for the different polypeptide sequences 
are Ugated together in-fiame in accotdance with conventional techniques, e.g;, by employing 
bltmt-ended or stagger-ended termini for ligation, restriction enzyme digestion to provide for 

30 appropriate termini, filling-in of cohesive ends as appropriate, alkaline phosphatase treatment to 
avoid iindesirable joining, and en2ymatic ligation. In another embodiment, the fusion gene can 
be synthesi2Bd by conventional techniques including automated DNA syntiiesizers. 
Alternatively, PGR amplification of gene fragments can be carried out using anchor primers that 
give rise to complementary overhangs between two consecutive gene fragments that can 

35 subsequently be annealed and reamplified to generate a chimeric gene sequence (see, for 
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example, Ausubel et al. (eds.) Current Protocols in Molecular Biology, John Wiley & 
Sons, 1 992). Moreover, many expression vectors are commercially available that already encode 
a fusion moiety a GST polypeptide). A nucleic acid encoding a polypeptide of the 
invention can be cloned into such an expression vector such that (he fusion moiety is linked 
5 in-fiame to the protein of the invention. 

4.8 GENE THERAPY 

Mutations in the polynucleotides of the invention gene may result in loss of normal 
function of the encoded protein. The invention thus provides gene therapy to restore normal 

10 activity of the polypeptides of the invention; or to treat disease states involving polypeptides of 
the invention. Delivery of a functional gene encoding polypeptides of the invention to 
appropriate cells is effected ex vivo, in situ, or in vivo by use of vectors, and more particularly 
viral vectors (e.g., adenovirus, adeno-associated virus, or a retrovirus), or ex vivo by use of 
physical DNA transfer methods (e.g. , liposomes or chemical treatments). See, for example, 

15 Anderson, Nature, supplement to vol. 392, no. 6679, pp.25-20 (1998). For additional reviews of 
gene therapy technology see Friedmann, Science, 244: 1275-1281 (1989); Verma, Scientific 
American: 68-84 (1990); and Miller, Nature, 357: 455-460 (1992). Introduction of any one of 
the nucleotides of the present invention or a gene encoding the polypeptides of the present 
invention can also be accomplished vsdth extrachromosomal substrates (transient expression) or 

20 artificial chromosomes (stable expression). Cells may also be cultured ex vivo in the presence of 
proteins of the present invention in order to proliferate or to produce a desired effect on or 
activity in such cells. Treated cells can then be introduced in vivo for therapeutic purposes. 
Alternatively, it is contemplated that in other human disease states, preventing the eiqiression of 
or inhibiting the activity of polypeptides of tiie invention will be useful in treating the disease 

25 states. It is conten^lated that antisense therapy or gene therspy could be applied to negatively 
regulate the e^qpression of polypeptides of the iavention. 

Other methods inhibiting expression of a protein include the introducticHi of antisense 
moleciiles to Ifae nucleic acids of the present invention, their con^lements, or their translated RNA 
sequences, by methods known in the art FmAer, the polypeptides of the present invention can be 

30 inhibited by using targeted deletion methods, or tiie insertion of a negative regulatory element such 
as a silencer, which is tissue specific. 

The present invention still finrtherprovides cells genetically engineered in vivo to ejqness the 
polynucleotidesof the invention, v^iierein sudi polynucleotides are in operative association with a 
regulatory sequence heterologous to the host cell which drives ejcpression of the polynxicleotides in 
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the cell. These methods can be used to increase or decrease the expression of tiie polynucleotides of 
the present invention. 

Knowledge of DNA sequences provided by the invention allows for modification of cells to 
permit, increase, or decrease, ejqjression of endogenous polypeptide. Cells can be modified (e.g. , by 
5 homologous recombination) to provide increased polypeptide expression by replacing, in v^ole or 
in part, tiie naturally occurring promoter with all or part of a heterologous promoter so that the cells 
express tiie protein at higher levels. The heterologous promoter is inserted in such a manner that it is 
operatively linked to the desired protein encoding sequences. See, for example, PCT International 
PublicationNo. WO 94/12650, PCT IntemationalPublicationNo. WO 92/20808, and PCT 

10 InteniationalPublicationNo. WO 91/09955. It is also contemplated tiiat, in addition to heterologous 
promoter DNA, amplifiable marker DNA {e.g., ada, dhfir, and the multifunctional CAD gene -vAash 
encodes carbamyl phosphate synthase, aspartate transcarbamylase, and dihydroorotase) and/or 
intron DNA may be inserted along with flie heterologous promoter DNA. If linked to the desired 
protein coding sequence, ampUfication of the marker DNA by standard selection methods resiihs in 

15 co-amplificationofihe desired protein coding sequences in the cells. 

In another embodiment of the present invention, cells and tissues may be engineered to 
express an endogenous gene compriang the polynucleotides of the invention imder tiie control of 
inducible regulatory elements, in which case the regulatoiy sequences of the endogenous gene may 
be replaced by homologous recombination. As described herein, gene targeting can be used to 

20 replace a gene' s existing regulatory region with a regulatory sequence isolated firom a different gene 
or a novel regulatory sequence synthesized by genetic engineeringmelfaods. Such regulatory 
sequences may be comprised of promoters, enhancers, scafibld-attachmentregions, negative 
regulatory elements, transcriptional initiation sites, regulatory protein binding sites or combinations 
of said sequences. Alternatively, sequences which affect the structure or stability of the RNA or 

25 proteinproduced may be replace4 removed, added, or otiiervsdsemodifiedby targeting. These 

sequences include polyadenylationsignals, mRNA stability elements, splice sites, leader sequences 
for enhancing or modifying transport or secretionproperties of the protein, or other sequences 
■which alter or improve the fimction or stability of protein or RNA molecules. 

The targeting event may be a simple insertion of the regulatory sequence, placing the gene 

30 imder the control of the new regulatory sequence, e.g., inserting a new promoter or enhancer or both 
upstream of a gene. Altemativdy, the tai^eting event may be a simple deletion of a regulatory 
element, such as the deletion of a tissue-specific negative regulatory element Alternatively, the 
targeting event may replace an existing element for escample, a tissue-specific enhancer can be 
replaced by an enhancer that has broader or diETerent cell-type specificity than the naturally 

35 occurring elements. Here, the naturally occurring sequences are deleted and new sequences are 



35 



wo 01/57188 



PCTAJSOl/03800 



added. In all cases, the identificadon of the targeting event may be fedlitated by Hhe use of one or 
more selectable marker genes that are contiguous with the targeting DNA, allowing for the selection 
of cells in which the exogenous DNA has integrated into the cell genome. Tbe identijGcation of the 
targeting event may also be &cilitated by the use of one or more marker genes exhibiting the 
property of negative selection, such that the negatively selectable marker is linked to the exogenous 
DNA, but configured such that the negatively selectable madcer flanks the targeting sequence, and 
such that a correct homologom recombination event with sequences in the host cell genome does 
not result in the stable integration of the negatively selectable laarket. Markers usefiil for this 
purpose include the Heii)es Simplex Virus thymidine kinase (TK) gene or the bacterial 
xanthine-guanine phosf^oribosyl-lraiisferase (gpt) gene. 

The gene targeting or gene activation techniques vvUlch can be used in accordance with this 
aspect of the invention are more particularly described in U.S. Patent No. 5,272,071 to Ch^:»pel; 
U.S. PatentNo. 5,578,461 to Sherwin et al.; International AppJicalionNo. PCT/US92/09627 
(WO93/09222)by Selden et al; and International ApplicadonNo. PCT/US90/06436 
(W09 1/06667) by Skoultchi et al., each of which is incorporated by reference herein in its entirety. 

4.9 TRANSGENIC ANIMALS 

In preferred methods to determine biological functions of the polypeptides of the 
invention in vivo, one or more genes provided by the invention are either over expressed or 
- 20 inactivated in the geam line of animals using homologous recombination [Capecchi, Science 
244:1288-1292 (1989)]. Animals in 'niiich the gene is over expressed, under the regulatory 
control of exogenous or endogenous promoter elements, are known as transgenic animals. 
Animals in which an endogenous gene has been inactivated by homologous recombination are 
referred to as "knockout" animals. Knockout animals, preferably non-human manunals, can be 
25 prepared as described in U.S. Patent No. 5,557,032, incorporated herein by reference. Transgenic 
anhnals are useful to determine the roles polypeptides of the invention play in biological 
processes, and preferably in disease states. Transgenic animals are usefiil as model systems to 
identify compounds that modulate lipid metabolism. Transgenic animals, preferably non-human 
mammals, are produced using methods as described in U.S. Patent No 5,489,743 and PCT 
30 Publication No. W094/28122, incorporated herein by reference. 

Transgenic animals can be prepared wherein all or part of a promoter of the 
polynucleotides of the invention is either activated or inactivated to alter the level of expression 
of the polypeptides of the invention. Inactivation canbe carried out using homologous 
recombination methods described above. Activation can be achieved by siq)plementing or even 
35 replacing the homologous promoter to provide for increased protein expression. The homologous 

36 

1 



10 



wo 01/57188 



PCT/USOl/03800 



promoter can be supplemented by insertion of one or more heterologous enhancer elemoits 
known to confer promoter activation in a particular tissue. 

The polynucleotides of the present invention also make possible the development, 
through, e.g., homologous recombination or knock out strategies, of animals that feil to express 
5 polypeptides of the invention or that express a variant polypeptide. Such animals are usefiil as 
models for studying the in vivo activities of polypeptide as well as for studying modulators of the 
polypeptides of the invention. 

In preferred methods to determine biological functions of the polypeptides of the 
invention in vivo, one or more genes provided by the invention are either over e}q)ressed or 

1 0 inactivated in the germ line of animals using homologous recombination [Capecchi, Science 
244:1288-1292 (1989)]. Animals in which the gene is over ejqjressed, under tibe regulatory 
control of exogenous or endogenous promoter elements, are known as transgenic animals. 
Animals in which an endogenous gene has been inactivated by homologous recombination are 
referred to as "knockout" animals. Knockout animals, preferably non-human mammals, can be 

1 5 prepared as described in U.S. Patent No. 5,557,032, incorporated herein by reference. Transgenic 
animals are useful to detennine the roles polypeptides of the invention play in biological 
processes, and preferably in disease states. Transgenic animals are iisefiil as model systems to 
identify compounds that modulate lipid metabolism. Transgenic animals, preferably non-human 
mammals, are produced using methods as described in U.S. Patent No 5,489,743 and PCT 

20 Publication No. W094y28122, incorporated herein by reference. 

Transg^c animds can be prepared wherein all or part of the polynucleotides of the 
invention promoter is either activated or inactivated to alter the level of ejq)ression of the 
polypeptides of the inventiotL Inactivadon can be carried out using homologous recombination 
methods described above. Activation can be achieved by supplementing or even replacing the 

25 homologous promoter to provide for increased protein e}q)ressioa The homologous promoter 
can be supplemented by insertion of one or more heterologous enhancer elements known to 
confer ivomoter activation in a particular tissue. 

4.10 USES AND BIOLOGICAL ACWVITy 

30 The polynucleotides and proteins of the present invention are expected to exhibit one or 

more of the uses or biological activities (including those associated with assays cited herein) 
identified herein. Uses or activities described for proteins of the present invention may be 
provided by adminis tration or use of such proteins or of polynucleotides encoding such proteins 
(such as, for example, in gene ther^ies or vectors suitable for introduction of DNA). The 

35 mechanism imderlying the particular condition or pathology will dictate v^ether the 
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polypeptides of the invention, the polynucleotides of the invention or modulators (activators or 
inhibitors) thereof would be beneficial to the subject in need of treatment Thus, "^erapeutic 
compositions of the invention" include compositions comprising isolated polynucleotides 
(including recombinant DNA molecules, cloned genes and degenerate variants thereof) or 
5 polypeptides of the invention (including full length protein, mature protein and truncations or 
domains thereof), or compounds and other substances that modiilate the overall activity of the 
target gene products, eith^ at the level of target gene/protein expression or target protein 
activity. Such modulators include polypeptides, analogs, (\^ants), including fragments and 
fusion proteins, antibodies and other binding proteins; chemical compounds that directly or 

10 indirectly activate or inhibit the polypeptides of the invention (Identified, e.g. , via drug screening 
assays as described herein); antisense polynucleotides and polynucleotides suitable for triple 
helix formation; and in particular antibodies or other binding partners that specifically recognize 
one or more epitopes of tiie polypeptides of the inventioiL 

The polypeptides of the present invention may likewise be involved in cellular activation 

IS or in one of the other physiological pathways described herein. 

4.10.1 RESEARCT USES AND UTBLmES 

The polynucleotides provided by the present invention can be used by the research 
community for various purposes. The polynucleotides can be used to express recombinant 

20 protein for analysis, characterization or therapeutic use; as marlcers for tissues in which the 

corresponding protein is preferentially expressed (either constitudvely or at a particular stage of 
tissue differentiation or development or in disease states); as molecular weight markers on gels; 
as chromosome markers or tags (v/hea labeled) to identify chromosomes or to map related gene 
positions; to compare with endogenous DNA sequences in patients to identify potential genetic 

25 disorders; as probes to hybridize and thus discover novel, related DNA sequences; as a source of 
information to derive PCR primers for genetic fingerprinting; as a probe to "subtract-out" known 
sequences in the process of discovering other novel polynucleotides; for selecting and making 
oligomers for attachment to a "gene chip" or otho: support, including for examination of 
expression patterns; to raise anti-protdn antibodies using DNA immunization techniques; and as 

30 an antigen to raise anti-DNA antibodies or elicit another immune response. Where the 

polynucleotide encodes a protein which binds or potentially binds to another protein (such as, for 
example, in a recq)tor-hgand interaction), the polynucleotide can also be used in interaction trap 
assays (such as, for example, that described in Gyuris et al.. Cell 75:791-803 (1993)) to identify 
polynucleotides encoding the other protein with which binding occurs or to identify inhibitors of 

35 the binding interaction. 
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The polypeptides provided by the presemt inveation can similarly be used in assays to 
detennine biological activity, including in a panel of miiltiple proteins for high-throughput 
screening; to raise antibodies or to elicit another immune response; as a retigent (including the 
labeled reagent) in assays designed to quantitatively detennine levels of the protein (or its 
• 5 receptor) in biological flxiids; as markers for tissues in which the corresponding polypeptide is 
preferentially expressed (either constitutively or at a particular stage of tissue difiereotiation or 
development or in a disease state); and, of course, to isolate correlative receptors or ligands. 
Proteins involved in these binding interactions can also be used to screen for peptide or small 
molecule inhibitors or agonists of the binding interaction. 

10 Any or all of these research utilities are enable of being developed into reagent grade or 

kit format for commercialization as research products. 

Metiiods for performing the uses listed above are well known to those skilled in the art. 
References disclosing such methods include without limitation "Molecular Cloning: A 
Laboratory Manual", 2d ed., Cold Spring Harbor Laboratory Press, Sambrook, J., E. F. Fritsch 

15 and T. Maniatis eds., 1 989, and "Methods in En2ymology: Guide to Molecular Cloning 
Techniques", Academic Press, Berger,, S. L. and A. R. Kimmel eds., 1 987. 

4.10 J NUTRITIONAL USES 

PolynucleotidBS and polypeptides of the present invention can also be used as nutritional 
20 sources or supplements. Sudi uses include without limitation use as a protein or amino acid 

si^pl^ent, use as a carbon source, use as a nitrogen source and use as a source of carbohydrate. In 
such cases the polypeptide or polynucleotide of the invention can be added to the feed of a 
particular organism or can be administered as a sepsaate solid or liquid preparation, such as in the 
form of powder, pills, solutions, suspensions or capsules. In the case of microorganisms, the 
25 pol3rpq)tide or polynucleotide of the invraition can be added to the medium in or on which the 
microorganism is cultured. 

4.103 CYTOKINE AND CELL PROLIFERATION/DIFFERENTIATION 
ACilVlTY 

30 A polypeptide of the present invention may exhibit activity relating to cytokine, cell 

proliferation (either inducing or inhibiting) or cell differentiation (either inducing or inhibiting) 
activity or may induce production of other cytokines in certain cell populations. A 
polynucleotide of the invention can encode a polypeptide exhibiting such attributes. Many 
protein factors discovered to date, including all known cytokines, have exhibited activity in one 

35 or more factor-dependent cell proliferation assays, and hence the assays serve as a convenient 
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confLtmation of cytokine activity. The activity of therapeutic compositiotis of the present 
invention is evidenced by any one of a number of routine fector dependent cell proliferation 
assays for cell lines including, without limitation, 32D, DA2, DAIG, TIO, B9, B9/1 1, BaF3, 
MC9/G, M+(preB M+), 2E8, RB5, DAI, 123, Tl 165, HT2, CTLL2, TF-1, Mo7e, CMK, 
5 HUVEC, and Caco. Therapeutic compositions ofthe invention can be used in the following: 
Assays for T-cell or thymocyte proliferation include without limitation those described 
in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Maigulies, E. 
M. Shevacli, W. Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Oiapter 3, 
In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 7, Immunologic studies in 

10 Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; Bertagnolli et al., J. Immimol. 

145:1706-1712, 1990; Bertagnolli et al., Cellular Immunology 133:327-341, 1991;BertagnoUi, 
etal., I. Immunol. 149:3778-3783, 1992; Bowman etal., I. Immunol. 152:1756-1761, 1994. 

Assays for cytokine production and/or proliferation of spleen cells, lymph node ceUs or 
thymocytes include, without limitation, those described in: Polyclonal T cell stimulation, 

15 Kruisbeek, A. M. and Shevach, E. M. In Current Protocols in Immunology. J. E. e.a. Coligan 
eds. Vol 1 pp. 3.12.1-3.12.14, John Wiley and Sons, Toronto. 1994; and Measurement of mouse 
and human interleukin-ir, Schreiber, R. D. In Current Protocols in Immunology. J. E. e.a. Coligan 
eds. Vol 1 pp. 6.8.1-6.8.8, John Mley and Sons, Toronto. 1994. 

Assays for proliferation and difierentialion of hematopoietic and lymphopoietic cells 

20 include, without limitation, those described in: Measurement of Himian and Miuine Interleukin 2 
and hiterleukin 4, Bottomly, K., Davis, L. S. and Lipsky, P. E. In Current Protocols in 
Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 6J.1-6.3.I2, John Wiley and Sons, Toronto. 1991; 
deVries et al., J. Exp. Med. 173:1205-121 1, 1991; Moreau et al.. Nature 336:690-692, 1988; 
Greenberger et al., Proc. NatL Acad. Sci. U.S.A. 80:2931-2938, 1983; Measurement of mouse 

25 and human interleukin 6— Nordan, R. In Current Protocols in Immxmology. J. E. Coligan eds. Vol 
1 pp. 6.6.1-6.6.5, John Wiley and Sons, Toronto. 1991; Smith et al., Proc. Natl. Aced. Sci. 
U.S.A. 83:1857-1861, 1986; Measurementof human Interleukin 11 -Bennett, F., Giannotti, J., 
Clack, S. C. and Turner, K. J. In Current Protocols in Immunology. J. E. Coligan eds. Vol 1 pp. 
6.15.1 John Wiley and Sons, Toronto. 1991; Measurement of mouse and human Interleukin 

30 9-Ciarletta, A., Giannotti, J., Clark, S. C. and T\mier, K. J, In Current Protocols in Immunology. 
J. E. Coligan eds. Vol 1 pp. 6.13.1, John Wiley and Sons, Toronto. 1991. 

Assays for T-ceU clone responses to antigens (v^ch will identify, among others, proteins 
that affect APC-T cell interactions as well as direct T-cell effects by measuring proliferation and 
cytokine production) include, without limitation, those described in: Current Protocols in 

35 Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. M Shevach, W Strober, 



40 



wo 01/57188 PCT/USOl/03800 

Pub. Greene Publishing Associates and Wiley-Interscieace (Chapter 3, In Vitro assays for Mouse 
Lymphocyte Function; Chapter 6, Cytokines and their cellular receptors; Chapter 7, 
Immunologic studies in Humans); Weinberger et al., Proc. Natl. Acad. Sci. USA 77:6091-6095, 
1980; Weinberger et al., Eur. J. Immun. 11:405-411, 1981; Takai et al., J. ImmunoL 
5 137:3494-3500, 1986; Takai et al., J. Immunol. 140:508-512, 1988. 

4.10.4 STEM CELL GROWTH FACTOR ACnVTTY 

A polypeptide of the present invration may exhibit stem cell growth factor activity and 
be involved in the proliferation, difEerentiation and survival of pluripotent and totipotent stem 

1 0 cells including primordial germ cells, embryonic stem cells, hematopoietic stem cells and/or 
germ line stem cells. Administration of the polypeptide of the invention to stem cells in vivo or 
ex vivo is expected to maintain and ejqiand cell populations in a totipotential or pluripotential 
state which would be useful for re-engineering damped or diseased tissues, transplantation, 
manufacture of bio-pharmaceuticals and the development of bio-sensors. The ability to produce 

15 . large quantities of hujnan cells has important working applications for the production of human 
proteins which cinxentiy must be obtained from non-human sources or donors, implantation of 
cells to treat diseases such as Parkinson's, Alzheimer's and other neurodegenerative diseases; 
tissues for grafting such as bone marrow, skin, cartilage, tendoiis, bone, muscle (including 
cardiac muscle), blood vessels, cornea, neural cells, gastroiatestinal cells and others; and organs 

20 for transplantation such as kidney, liver, pancreas (including islet cells), heart and lung. 

It is contemplated that multiple different exogenous growdi factors and/or cytokines may 
be administered in combination with the polypeptide of the invention to achieve the desired 
effect, including any of the growth &ctors listed herein, other stem cell maintenance Actors, and 
specifically including stem cell factor (SCF), leukemia inhibitory fector QJF), Flt-3 ligand (Flt- 

25 3L), any of the interleuldns, recombinant soluble IL-6 receptor fused to IL-6, macrophage 

mflammatory protein 1 -alpha (MEP-l -alpha), G-CSF, GM-CSF, thrombopoietin (TPO), platelet 
factor 4 (PF-4), platelet-derived growth factor (PDGF), neural growth fectors and basic 
fibroblast growth fector (bFGF). 

Since totipotent stem cells can give rise to virtually any mature cell type, expansion of 

30 these cells in culture will facilitate the production of large quantities of mature cells. Techniques 
for culturing stem cells are known in the art and administration of polypeptides of the invention, 
optionally with otiier growth fectors and/or cytokines, is expected to enhance the survival and 
proliferation of the stem cell populations. This can be accomplished by direct administration of 
the polypeptide of the invention to the culture medium. Alternatively, stroma cells transfected 

35 with a polynucleotide that encodes for the polypeptide of the invention can be used as a feeder 



41 



wo 01/57188 



PCT/USOl/03800 



layer for the stem cell populations in culture or in vivo. Stromal support cells for feeder layers 
may include embryonic bone marrow fibroblasts, bone marrow stromal cells, fetal liver cells, or 
cultured embryonic fibroblasts (see U.S. Patent No. 5,690,926). 

Stem cells themselves can be transfected with a polynucleotide of the invention to Induce 
5 autocrine expression of ibs polypeptide of the invention. This will allow for generation of 
undifferentiated totipotential/pluripotential stem cell lines that are useful as is or that can then be 
difierentiated into the desired mature cell types. These stable cell lines can also serve as a source 
of undifferentiated totipotential/pluripotential mRNA to create cDNA libraries and templates fiar 
polymerase chain reaction experiments. These studies would allow for the isolation and 

10 identification of differentially expressed genes in stem cell populations fliat regulate stem cell 
proliferation and/or maintenance. 

Expansion and maintenance of totipotent stem cell populations will be useful in the 
treatment of many pathological conditions. For example, polypeptides of the present invention 
may be used to manipulate stem cells in culture to give rise to neuroepithelial cells that can be 

15 used to augment or replace cells damaged by Ulness, autoinmnme disease, accidental damage or 
genetic disorders. The polypeptide of the invention may be usefiil for inducing the proliferation 
of neural cells and for the regeneration of nerve and brain tissue, i. e. for the treatmrait of central 
and peripheral nervo\is system diseases and neuropathies, as well as mechanical and traumatic 
disorders which involve degeneration, death or trauma to neural cells or nerve tissue. In addition,. 

20 the expanded stem cell populations can also be genetically altered for gene therapy purposes and 
to decrease host rejection of replacement tissues after grafting or implantation. 

Expression of the polypeptide of the invention and its effect on stem cells can also be 
manipulated to achieve controlled differentiation of the stem cells into more differentiated cell 
types. A broadly applicable method of obtaining pure populations of a specific differentiated 

25 cell type from undifferentiated stem cell populations involves the use of a cell-type specific 

promoter drivmg a selectable marker. The selectable marker allows only cells of the desired type 
to survive. For example, stem cells can be induced to differentiate into cardiomyocytes (Wobus 
et al.. Differentiation, 48: 173-182, (1991); Klug et al., J. CUn. Invest, 98(1): 216-224, (1998)) 
or skeletal muscle cells (Browder, L. W. In: Principles of Tissue Engineering eds. Lanza et al., 

30 Academic Press (1997)). Alternatively, directed differentiation of stem cells can be 

accomplished by culturing the stem cells in the presence of a differentiation factor such as 
retinoic acid and an antagonist of the polypeptide of the invention vMch would inhibit the 
effects of endogenous stem cell factor activity and allow differentiation to proceed. 

In vitro cultures of stem cells can be med to determine if the polypeptide of the invention 

35 exhibits stem cell growth factor activity. Stem cells are isolated fiom any one of various cell 
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sources (including hematopoietic stem cells and embryonic stem cells) and cultured on a feeder 
layer, as described by Thompson et al. Proc. Natl. Acad. Sci, U.S A., 92: 7844-7848 (1995), in 
the presence of the polypeptide of tiie invention alone or in combination with other growtti 
Victors or cytokines. Hie ability of the polypeptide of the invention to induce stem cells 
5 proliferation is determined by colony formation on semi-solid support e.g. as described by 
■ Bernstein et al.. Blood, 77: 2316-2321 (1991). 

4.10.5 HElVLATOPOIESISREGlJIJiTING ACTIVITY 

A polypeptide of die present invention may be involved in regulation of hematopoiesis 

10 and, consequently, in the treatment of myeloid or lymphoid cell disorders. Even marginal 

biological activity in support of colony forming cells or of &ctor-dependrait cell lines indicates 
involvement in regulating hematopoiesis, e.g. in supporting the growth and proliferation of 
erytfaroid progenitor cells alone or in combination with other cytokines, thereby indicating 
utility, for example, in treating various anemias or for use in conjunction with 

1 S irradiation/chemotherapy to stimulate the production of erythroid precursors and/or eiydiroid 
cells; in supporting the groAvdi and proliferation of myeloid cells such as granulocytes and 
monocytes/macrophages (/.&, traditional CSF activity) useiiil, for example, m conjunction witii 
chemotherapy to prevent or treat consequent myelo-suppression; in supportiog the growth and 
proliferation of megakaryocytes and consequently of platelets thereby allowing prevention or 

20 treatment of various platelet disorders such as thrombocytopenia, and generally for use in place 
of or complimentary to platelet transfusions; and/or in supporting the growth and proliferation of 
hematopoietic stem ceUs which are capable of maturing to any and all of the above-mentioned 
hematopoietic cells and therefore find ther^eutic utiUty in various stem cell disorders (such as 
those usually treated with transplantation, including, without limitation, aplastic anemia and 

2S parox3rsmal nocturnal hemoglobinuria), as well as in repopulating the stem cell compartment 
post irradiation/chemotherapy, either in-vivo or ex-vivo (i.e., in conjunction with bone marrow 
transplantation or with peripheral progenitor cell transplantation (homologous or heterologous)) • 
as normal cells or genetically manipulated for gene therqiy. 

Therapeutic compositions of the invention can be used in the following: 

30 Suitable assays for proliferation and differentiation of various hematopoietic lines are 

cited above. 

Assays for embryonic stem cell differentiation (which will identify, among others, 
proteins that influence embryonic differentiation hematopoiesis) include, without limitation, 
those described in: Johansson et al. Cellular Biology 15:141-151, 1995; Keller et al.. Molecular 
35 and CeUular Biology 13:473-486, 1993; McClanahan et al.. Blood 81 :2903-2915, 1993. 
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Assays for stem cell survival and differentiation (wWch wiU identify, among others, 
proteins ihat regulate lympho-hemalopoiesis) include, Avitfaout limitation, those described in: 
MethylceUnlose colony forming assays, Freshney, M. G. In Culture of Hematopoietic Cells. R. I. 
Freshney, et aL eds. Vol pp. 265-268, Wiley-Liss. Inc., New Yo±, N.Y. 1994; Hiiayama et al., 
5 Proc. Natl. Acad. Sci. USA 89:5907-591 1, 1992; Primitive hematopoietic colony forming cells 
with high proliferative potaitial, McNiece, I. K. and Briddell, R. A. In Culture of Hematopoietic 
Cells. R. I. Freshney, et al. eds. Vol pp. 23-39, Wiley-Liss, Inc., New York, N.Y. 1994; Neben et 
al., £}q>erimental Hematology 22:353-359, 1994; Cobblestone area forming cell assay, 
Ploemacher, R. E. In Culture of Hematopoietic Cells. R. I. Freshney, et al. eds. Vol pp. 1-21, 
10 Wiley-Liss, Inc., New York, N.Y. 1994; Long term bone marrow cultures in the presence of 
stromal cells, Spooncer, E., Dexter, M. and Allen, T. In Culture of Hematopoietic Cells. R I. 
Freshney, et aL eds. Vol pp. 163-179, Wdey-Liss, Inc., New York, N.Y. 1994; Long term culture 
initiating cell assay, Sutherland, H. J. In Culture of Hematopoietic Cells. R. I. Freshney, et al. 
eds. Vol pp. 139-162, Wiley-Liss, Inc., New York, N.Y. 1994. 

15 

4.10.6 TISSUE GROWTH ACnVlTY 

A polypeptide of the present invention also may be involved in bone, cartilage, tendon, 
Ugament and/or nerve tissue growth or regeneration, as well as in wound healing and tissue 
repair and replacement, and in healing of bums, incisions and ulcers. 

20 A polypeptide of the present invention which induces cartilage and/or bone growth in 

circumstances v^ere bone is not normally formed, has application in the healing of bone 
fiactures and cartilage damage or defects in humans and other animals. Compositions of a 
polypeptide, antibody, binding partner, or other modulator of the invention may have 
prophylactic use in closed as well as open fiacture reduction and also in the improved fixation of 

25 artificial joints. De novo bone formation induced by an osteogenic agent contributes to the 
repair of congenital, trauma iaduced, or oncologic resection induced cranio&cial defects, and 
also is usefiil in cosmetic plastic surgety. 

A polypeptide of this invention may also be involved in attracting bone-forming cells, 
stimulating growth of bone-forming cells, or induciag differentiation of progenitors of 

30 bone-forming cells. Treatment of osteoporosis, osteoarthritis, bone degenerative disorders, or 
periodontal disease, such as through stimulation of bone and/or cartilage repair or by blocking 
inflammation or processes of tissue destruction (collagenase activity, osteoclast activity, etc.) 
mediated by inflammatory processes may also be possible usiog the composition of the 
invention. 
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AnoAer category of tissue legmeratioii activity that may involve the polypeptide of the 
present invention is tendon/ligament formation. Induction of tendon/ligament-like tissue or 
oihei tissue formation in circumstances ■where such tissue is not notmally formed, has 
application in the healing of tendon or ligament tears, deformities and other tendon or ligament 
5 defects in humans and other animals. Such a preparation employing a tendon/ligamaat-like 
tissue inducing protein may have prophylactic use in preventing damage to tendon or ligament 
tissue, as well as use in the improved fixation of tendon or ligament to bone or other tissues, and 
in repairing defects to tendon or ligament tissue. De novo tendon/ligament-lilffi tissue formation 
induced by a composition of the present invention contributes to the repair of congenital, trauma 

10 induced, or other tendon or ligament defects of other origin, and is also useful in cosmetic plastic 
surgery for attachment or repair of tendons or ligaments. The con:q>ositions of the present 
invention may provide environment to attract tendon- or Ugament-forming cells, stimulate 
growth of tendon- or ligament-forming cells, induce difTerentiation of progenitors of tendon- or 
ligament-forming cells, or induce growth of tendon/ligament cells or progenitors ex vivo for 

15 return in vivo to effect tissue repair. The compositions of the invention may also be useful in the 
treatment of tendinitis, carpal tunnel syndrome and other tendon or ligament defects. The 
compositions may also include an ^propriate matrix and/or sequestering agent as a carrier as is 
well known in the art. 

The compositions of the present invention may also be useful for proliferation of neural 

20 cells and for regeneration of nerve and brain tissue, i.e. for the treatment of central and peripheral 
nervous system diseases and neuropadiies, as well as mechanical and traumatic disgrders, vtiiich 
involve degeneration, death or trauma to neural cells or nerve tissue. More specifically, a 
composition may be used in the treatment of diseases of the peripheral nervous system, such as 
peripheral nerve injuries, peripheral neuropathy and localized neuropaOues, and central nervow 

25 system diseases, such as Alzheimer's, Parkinson's disease, Himtington's disease, amyotrophic 
lateral sclerosis, and Shy-Drager syndrome. Further conditions which may be treated in 
accordance with the present invention include mechanical and traumatic disorders, such as spinal 
cord disorders, hejid trauma and cerebrovascidar diseases such as stroke. Peripheral neuropathies 
resulting from chemotherapy or other medical therapies may also be treatable using a 

30 composition of the invention. 

Composdtions of the invention may also be useful to promote better or faster closure of 
non-healing wounds, including without limitation pressure ulcers, ulcers associated with vascular 
insufficiency, surgical and traumatic wounds, and the like. 

Compositions of the present invention may also be involved in the generation or 

35 regeneration of other tissues, such as organs (including, for example, pancreas, liver, intestine. 



45 



wo 01/57188 



PCTAJSOl/03800 



kidney, skin, endothelium), muscle (smooth, skeletal or cardiac) and vascular (inclviding vascular 
endothelium) tissue, or for promoting the growth of cells comprising such tissues. Part of the 
desired effects may be by inhibition or mod\ilation of fibrotic scarring may allow normal tissue 
to regenerate. A polypeptide of the present invention may also exhibit angiogenic activity. 
S A composition of the present invention may also be useful for gut protection or 

regeneration and treatment of lung or liver fibrosis, reperfiision injury in various tissues, and 
conditions resulting fix>m systemic cytokine damage. 

A composition of the present invention may also be useAil for promoting or inhibiting 
differentiation of tissues described above from precursor tissues or cells; or for inhibitiiig the 
10 growth of tissues described above. 

Therapeutic compositions of the invention can be used in the following: 

Assays for tissue generation activity include, without limitation, those described in: 
International Patent Publication No. WO95/16035 (bone, cartilage, tendon); International Patent 
Publication No. WO95/05846 (nerve, neuronal); International Patent Publication No. 
15 WO91/07491(skin,endotheUum). 

Assays for wound healing activity include, without linodtation, those described in: Winter, 
Epidermal Wound Healing, pps. 71-1 12 (Maibach, H. I. and Rovee, D. T., eds.). Year Book 
Medical Publishers, Inc., Chicago, as modified by Eaglstein and Mertz, J. Invest Dermatol 
71:382-84(1978). 

20 

4.10.7 IMMUNE STIMULATING OR SUPPRESSING ACTIVITY 
A polypeptide of the present invention may also exMbit immune stimulating or immune 
suppressing activity, including without limitation the activities for vAadi assays are desoibed 
herein. A polynucleotide of the invention can encode a polypeptide exhibiting such activities. A 

2S protein may be useful in tiie treatment of various immune deficiencies and disorders (including 
severe combined immimodeficiency (SCED)), €.g., in regulating (up or down) growth and 
proliferation of T and/or B lymphocytes, as well as effecting the cytolytic activity of NK cells 
and other cell populations. These immune deficiencies may be genetic or be caused by viial (e.g., 
mV) as well as bacterial or flingal infections, or may result fix>m autoimmune disorders. More 

30 specifically, infectious diseases causes by viral, bacterial, fimgal or other infection may be 

treatable using a protein of the present invention, including in&ctions by HTV, hepatitis viruses, 
herpes viruses, mycobacteria, Leishmania spp., malaria spp. and variom fimgal infections such 
as candidiasis. Of course, in this regard, proteins of the present invention may also be usefiil 
where a boost to the immime system generally may be desirable, i.e.,m tiie treatment of cancer. 
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Autoimmune disorders vdiich may be treated using a protein of the present invention 
include, for example, connective tissue disease, multiple sclerosis, systemic lupus erythematosus, 
rheumatoid arthritis, autoimmime pulmonary inflanamation, Gtiillain-Barre syndrome, 
autoimmune fhyroicUtis, insulin dependent diabetes meUitis, myasthenia gravis, graft-versus-host 
5 disease and autoimmune inflammatory eye disease. Such a protein (or antagonists thereof, 
including antibodies) of the present invention may also to be useful in flie treatment of allergic 
reactions and conditions (e.g., anaphylaxis, serum sickness, drug reactions, food allergies, insect 
venom allergies, mastocytosis, allergic ihimtis, hypersensitivity pneimionitis, urticaria, 
angioedema, eczema, atopic dermatitis, allergic contact dermatitis, erythema multiforme, 

10 Stevens- Johnson syndrome, allergic conjunctivitis, atopic keratoconjunctivitis, venereal 
keratoconjunctivitis, giant papillary conjunctivitis and contact allergies), such as asthma 
(particularly allergic asthma) or other respiratory problems. Other conditions, in which inunune 
suppression is desired (including, for example, organ transplantation), may also be treatable 
using a protein (or antagonists thereof) of the present invention. The therapeutic effects of the 

IS polypqptides or antagonists thereof on allergic reactions can be evaluated by in vivo animals 

models such as the cumulative contact raihancement test (Lastbom et al.. Toxicology 12S: S9-66, 
1998), skin prick test (Hof&nann et al.. Allergy 54: 446-54, 1999), guinea pig skin sensitization 
test (Vohr et al., Ardi. Toxocol. 73: 501-9), and murine local lymph node assay (Kimber et al., 
J. Toxicol. Environ. Health 53: 563-79). 

20 Using the proteins of the invention it may also be possible to modulate immime 

responses, in a number of ways. Down regulation may be in the form of inhilnting or blocking an 
inunune response already in progress or may involve preventing the induction of an immime 
response. The functions of activated T cells may be inhibited by suppressing T cell responses or 
by inducing specific tolerance in T cells, or both. Immunosuppression of T ceU responses is 

25 generally an active, non-antigen-specific, process v^ch requires continuoxis exposure of the T 
cells to tiie si^pressive agent Tolerance, which involves inducing non-responsiveness or anergy 
in T cells, is distinguishable from immunosuppression in that it is generally antigen-specific and 
persists after e;qK)Sure to the tolerizing agent has ceased. Operationally, tolerance can be 
demonstrated by the lack of a T cell response upon reexposure to specific antigen in the absence 

30 of the tolerizing agent. 

Down regulating or preventing one or more antigen ftmctions (including witiiotit 
limitation B lymphocyte antigen functions (such as, for example, B7)), e.g., preventing high 
level lymphokine synthesis by activated T cells, will be useful in situations of tissue, skin and 
organ transplantation and in graft-versus-host disease (GVHD). For example, blodcage of T cell 

35 fimction should result in reduced tissue destruction in tissue transplantation. Typically, in tissue 
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transplants, rejection of the transplant is initiated through its recognition as ferdgn by T cells, 
followed by an immune reaction that destroys the transplant. Hie administration of a tiierapeutic 
composition of the invention may prevent cytokine synthesis by immune cells, such as T cells, 
and Hms acts as an immimosuppressant Moreover, a lack of costimulation may also be su£Eicient 
5 to anergize the T cells, thraeby inducing tolerance in a subject Induction of long-term tolerance 
by B lymphocyte antigen-blocking reagents may avoid the necessity of repeated administration 
of these blocking reagents. To achieve sufficient immunosuppression or tolerance in a subject, it 
may also be necessary to block the function of a combination of B lymphocyte antigens. 
The efBcacy of particular therapeutic compositions in preventing organ transplant 

10 rejection or GVHD can be assessed using animal models that are predictive of efScacy in 

humans. Examples of ^propriate systems which can be used include allogeneic cardiac grafts in 
rats and xenogeneic pancreatic islet cell grafts in mice, both of which have been used to examine 
the immunbsiqjpressive effects of CTLA4Ig fusion proteins in vivo as described in Lenschow et 
al.. Science 257:789-792 (1992) and Turka et al., Proc. Natl. Acad. Sci USA, 89:1 1 102-1 1 105 

15 (1 992). In addition, murine models of GVHD (see Paul ed.. Fundamental Immunology, Raven 
Press, New York, 1989, pp. 846-847) can be used to detamine the effect of therapeutic 
compositions of the invention on the development of that disease. 

Blocking antigen function may also be thra-apeutically usefiil for treating autoinumme 
diseases. Many autoimmune disorders are the result of inappropriate activation of T cells that are 

20 reactive against self tissue and which promote the production of cj^kines and autoantibodies 
involved in the padiology of the diseases. Preventing the activation of autoreactive T' cells may 
reduce or eliminate disease symptoms. Administration of reagents wfaidi block stimulation of T 
cells can be iised to inhibit T cell activation and prevent production of autoantibodies or T 
cell-derived cytokines which may be involved in the disease process. Additionally, blocking 

25 reagents may induce antigen-specific tolerance of autoreactive T cells which could lead to 

long-term relief fix)m the disease. The efficacy of blocking reagents in preventing or alleviating 
autoimmune disorders can be determined using a number of well-characterized animal models of 
human autoinunune diseases. Examples include murine experimental autoinunune encephalitis, 
systemic lupus ^ythmatosis inMRL/lprApr mice orNZB hybrid mice, murine autoinunune 

30 collagen arthritis, diabetes mellitus in NOD mice and BB rats, and miuine experimental 

myasthenia gravis (see Paul ed.. Fundamental Immunology, Raven Press, New York, 1989, pp. 
840-856). 

Upregulation of an antigen fimction (e.g., a B lymphocyte antigen fiinction), as a means 
of up regulating immune responses, may also be useful in ther^y. Upregulation of umnune 
35 responses may be in the form of enhancing an existing immune response or eliciting an initial 
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immune response. For example, enhancing an immune response may be useful in cases of viral 
infection, including systemic viral diseases such as influenza, the common cold, and 
encephalitis. 

Alternatively, anti-viral immune responses may be enhanced in an infected patient by 
5 removing T cells fiom the patient, costimulating ^e T cells in vitro with viral antigen-pulsed 
APCs either expressing a peptide of the present invention or together with a stimulatory form of 
a soluble peptide of the present invention and reintroducing the in vitro activated T cells into the 
patient Another method of enhancing anti-viral immune responses would be to isolate infected 
cells from a patient, transfect them with a nucleic acid encoding a protein of the present 

10 invention as described herein such that &e cells express all or a portion of the protein on their 
sur&ce, and reintroduce the transfected cells into the patient. The infected cells would now be 
capable of delivering a costimulatory signal to, and thereby activate, T cells in vivo. 

A polypeptide of the present invention may provide the necessary stimulation signal to T 
cells to induce a T cell mediated immune response against the transfected tumor cells. In 

1 S addition, tumor cells which lack MHC class I or MHC class n molecules, or which fail to 

reexpress sufScient moxmts of MHC class I or MHC class II molecules, can be transfected with 
nucleic acid encoding all or a portion of (e.g., a cytoplasmic-domain truncated portion) of an 
MHC class I alpha chain protein and ^2 microglobulin protein or an MHC class U alpha chain 
protein and an MHC class II beta chain protein to thereby express MHC class I or MHC class II 

20 proteins on the cell sorSice. Expression of the appropriate class I or class Q MHC in conjunction 
wifli a peptide having the activity of a B lymphocyte antigen {e.g., B7-1, B7-2, B7-3) induces a T 
cell mediated immune response against the transfected tumor cell. Optionally, a gene encoding 
an antisense construct which blocks expression of an MHC class II associated protein, such as 
tiie invariant chain, can also be cotransfected with a DNA encoding a peptide having the activity 

25 of a B lymphocyte antigen to promote presentation of tumor associated antigens and induce 

tumor specific immunity. Thus, the induction of a T cell mediated immune response in a human 
subject may be sufficient to overcome tumor-specific tolerance in the subject 

The activity of a protein of the invention may, among other means, be measured by the 
following methods: 

30 Smtable assays for thymocyte or splenocyte cytotoxicity include, without limitation, 

those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. 
H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and 
Wiley-Interscience (Ch^ter 3, In Vitro assays for Moiise Lymphocyte Function 3.1-3.19; 
Chapter 7, Immunologic studies in Humans); Henmann et al., Proc. Natl. Acad. Sci. USA 

35 78.2488-2492, 1981; Herrmann et al., J. Immunol. 128:1968-1974, 1982; Handa et al., J. 
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Immunol. 135:1564-1572, 1985; Takai et aL, I. Immunol. 137:3494-3500, 1986; Takai et al., J. 
Immunol. 140:508-512, 1988;Bo\vmanetal., J. Virology 61:1992-1998; BertagnoUi et al.. 
Cellular Immunology 133:327-341, 1991; Brown et al., J. InmiunoL 153:3079-3092, 1994. 
Assays for T-cell-depoadent immunogtobulin responses and isotype switching (which 
5 will identify, among oth^, proteins that modulate T-cell dependent antibody responses and that 
affect Thl/Th2 profiles) include, without limitation, those described in: Maliszewski, J. 
Immimol. 144:3028-3033, 1990; and Assays fox B cell function: In vitro antibody production, 
Mond, J. J. and Brunswick, M In Current Protocols in Immunology. J. E. e.a. Coligan eds. Vol 1 
pp. 3.8.1-3.8.16, John Wiley and Sons, Toronto. 1994. 

10 Mixed lynq>hocyte reaction (MLR) assays (^^ch will identify, among others, proteins 

that generate predominantly Thl and CTL responses) include, without limitation, those described 
in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. 
M. Shevach, W. Strober, Pub. Greene Publishing Assodates and WUey-Interscience (Chapter 3, 
In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Ch^ter 7, Immunologic studies in 

15 Humans); Takai et al., J. hmtnunoL 137:3494-3500, 1986; Takai et al., J. Immunol. 140:508-512, 
1988; BertagnoUi etal., J. hnmunol. 149:3778-3783, 1992. 

Dendritic cell-dependent assays (vt^ch will identify, among others, proteins expressed 
by dendritic ceUs that activate naive T-cells) include, without limitation, those described in: 
Guery et al., J. Immunol. 134:536-544, 1995; Inaba et al.. Journal of Experimental Medicine 

20 173:549-559, 1991 ; Macatonia et al.. Journal of Immunology 154:5071-5079, 1995; Porgador et 
al.. Journal of Experimratal Medicine 182:255-260, 1995; Nair et al.. Journal of Virology 
67:4062-4069, 1993; Huang et al.. Science 264:961-965, 1994; Macatonia et aL, Journal of 
Experimental Medidne 169:1255-1264, 1989; Bhardw^ et al.. Journal of Clinical Investigation 
94:797-807, 1994; and Laaba et al.. Journal of Experimental Medicine 172:631-640, 1990. 

25 Assays for lymphocyte survival/apoptosis (vdiich will identify, among others, jnoteins 

that prevent qx>ptosis after siq)erantigen induction and proteins diat regulate lymphocyte 
homeostasis) include, without limitation, those described in: Darzynkiewicz et al.. Cytometry 
13:795-808, 1992; Gorczyca et al.. Leukemia 7:659-670, 1993; Gorczyca et al.. Cancer Research 
53:1945-1951, 1993; Itoh et al., CeU 66:233-243, 1991; Zacharchuk, Journal of Immunology 

30 145:4037-4045, 1990; Zamai et al.. Cytometry 14:891-897, 1993; Gorc2yca et al.. International 
Journal of Oncology 1:639-648, 1992. 

Assays for proteins that influence early steps of T-cell commitment and development 
include, withoiit limitation, those described in: Antica et al., Blood 84:11 1-1 17, 1994; Fine et al., 
CeUular Immunology 155:1 11-122, 1994; Galy et al.. Blood 85i2770-2778, 1995; Toki et al., 

35 Proc. Nat Acad Sci. USA 88:7548-755 1 , 1991 . 
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4.10.8 ACnvm/INHIBIN ACTIVITY 

A polypeptide of the present invention may also exhibit activin- or inhibin-related 
activities. A polynucleotide of tiie invention may encode a polypeptide exhibiting such 
characteristics. Inhibins are characterized by their ability to inhibit the release of follicle 
stimulating hormone (FSH), vtMe activins and are characterized by liieir ability to stimulate the 
release of follicle stimulating hormone (FSH). Thus, a polypeptide of the present invention, 
alone or in heterodimers with a member of the inhibin &mily, may be useful as a contraceptive 
based on the ability of inhibins to decrease fertility in female tnamTnwlR and decrease 
spermatogenesis in male mammals. Administration of sufficient amoimts of other inhibins can 
induce infertility in these mammals. Alternatively, the polypeptide of the invention, as a 
homodimer or as a heterodimer with other protein subunits of the inhibin group, may be useiiil as 
a fatility inducing therapeutic, based upon the ability of activin molecules in stimulating FSH 
release from cells of the anterior pituitary. See, for exanq)le, U.S. Pat. No. 4,798,885. A 
polypeptide of the invention may also be useful for advancement of the onset of fertility in 
sexually immature mammals, so as to increase the lifetime reproductive performance of domestic 
a nima l s such as, but not limited to, cows, sheep and pigs. 

The activity of a polypeptide of the invention may, among other means, be measured by 
the following methods. 

Assays for activin^nhibin activity include, -without limitation, tiiose described in: Vale et 
al.. Endocrinology 91:562-572, 1972; Ling et al.. Nature 321:779-782, 1986; Vale et al.. Nature . 
321:776-779, 1986; Mason et al.. Nature 318:659-663, 1985; Forage et al., Proc. Nail. Acad. Sci. 
USA 83:3091-3095, 1986. 

4.10.9 CHEMOTACnaCHEMOKINETIC ACTIVITY 

A pol}rpeptide of -flie present invention may be involved in chemotactic or chemokinetic 
activity for mammalian cells, including, for ^cample, monocytes, fibroblasts, neutrophils, 
T-cells, mast ceUs, eosinophils, epithelial and/or oidothelial cells. A polynucleotide of the 
invention can encode a polypeptide exhibiting sudi attributes. Chemotactic and chemokinetic 
receptor activation can be used to mobilize or attract a desired ceD population to a desired site of 
action. Chemotactic or chemokinetic compositions (e.g. proteins, antibodies, binding partners, or 
modulators of the invention) provide particular advantages in treatment of wounds and other 
trauma to tissues, -as well as in treatment of localized infections. For example, attraction of 
lyrnphocytes, monocytes or neutrophils to tumors or sites of infection inay result in improved 
immune responses against the tumor or infecting agent 
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A protein or peptide has chemotactic activity for a particular cell population if it can 
stimulate, directly or indirectly, tbe directed orientation or movement of such cell population. 
Preferably, the protein or peptide has the ability to directly stimulate directed movement of cells. 
Whether a particular protein has chemotactic activity for a population of cells can be readily 
5 determinedby employing such protein or peptide in any known assay for cell chemotaxis. 
Thoc^peutic compositions of the invention can be used ia the following: 
Assays for chemotactic activity (which will identify proteins that induce or prevent 
chemotaxis) consist of assays that measure the ability of a protein to induce the migration of 
cells aoross a membrane as well as the ability of a protein to induce the adhesion of one cell 
10 population to another cell population. Suitable assays for movement and adhesion include, 

without limitation, those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. 
M. Kruisbeek, D. H. Marguiles, E. M. Shevach, W. Strobe, Pub. Greene Publishing Associates 
and Wiley-Interscience (Chapter 6.12, Measurement of alpha and beta Chemoldnes 
6.12.1-6.12.28; Taubet al. J. Clin. Invest 95:1370-1376, 1995; Lind etal. APMIS 103:140-146, 
15 1995; Muller et al Eur. J. Immvmol. 25:1744-1748; Gnibor et al. J. of Immunol. 152:5860-5867, 
1994; Johnston et al. J. of Immunol. 153:1762-1768, 1994. 

4.10.10 HEMOSTATIC AND IBROMBOLYTIC ACnVFTY 

A polypeptide of the invention may also be involved in hranostatis or thrombolysis or 
20 thrombosis. A polynucleotide of the invention can encode a polypeptide exhibiting such 

attributes. Compositions may be usefiil in treatment of various coagulation disorders (including 
hereditary disorders, such as hemophilias) or to enhance coagulation and otiier hemostatic events 
in treating wounds resulting fiom trauma, surgery or other causes. A composition of the 
invention may also be useful for dissolving or inhibiting formation of thromboses and for 
25 tteatmeDt and prevention of conditions resulting th^iefiom (such as, for example, in&rction of 
cardiac and central nervous system vessels (e.g., stroke). 

Thenq)eutic compositions of the invention can be used in the following: 
Assay for hemostatic and thrombolytic activity include, without limitation, those 
described in: Linet et al., J. Qin. PharmacoL 26:131-140, 1986; Burdick et al.. Thrombosis Res. 
30 45:413-419, 1987; Humphrey et al.. Fibrinolysis 5:71-79 (1991); Schaub, Prostaglandins 
35:467-474, 1988. 

4.10.11 CANCER DIAGNOSIS AND THERAPY 

Polypeptides of the invention may be involved in cancer cell generation, proliferation or 
35 metastasis. Detection of the presence or amount of polynucleotides or polypeptides of the 
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invention be useful for the diagnosis and/or prognosis of one or more types of cancer. For 
example, the presence or increased e}q>ression of a polynucleotide/polypeptide of Hae invention 
may indicate a hereditary risk of cancer, a precancerous condition, or an ongoing malignancy. 
Conversely, a defect in the gene or absoice of the polypeptide may be associated with a cancer 
S condition. IdentijScation of single nucleotide polymorphisms associated with cancer or a 
predisposition to cancer may also be useful for diagnosis or prognosis. 

Cancer treatments promote tumor regression by inhibiting tumor cell proliferation, 
inhibiting angiogenesis (growth of new blood vessels that is necessary to siqiport tumor growth) 
and/or prohibiting metastasis by reducing tumor cell motility or invasiveness. Ther^utic 

10 compositions of the invention may be effective in adult and pediatric oncology including in solid 
phase tumors/malignancies, locally advanced tumors, human soft tissue sarcomas, metastatic 
cancer, including lymphatic metastases, blood cell malignancies including multiple myeloma, 
acute and chronic leukemias, and lymphomas, head and neck cancers induding mouth cancer, 
larynx cancer and thyroid cancer, lung cancers including small cell carcinoma and non-small cell 

IS cancers, breast cancers including small cell carcinoma and ductal carcinoma, gastrointestinal 

cancers including esophageal cancer, stomach cancer, colon cancer, colorectal cancer and polyps 
associated with colorectal neoplasia, pancreatic cancers, liver cancer, urologic cancers including 
bladder cancer and prostate cancer, malignancies of &e female gmital tract including ovarian 
carcinoma, uterine (including endometrial) cancers, and solid tumor in the ovarian follicle, 

20 kidney cancers including renal cell carciaoma, brain cancers including intrinsic brain tumors, 
neuroblastoma, astrocytic brain tumors, gliomas, metastatic tumor cell invasion in the ceatral 
nervous system, bone cancers including osteomas, skin cancers including malignant melanoma, 
tumor progression of human skin keratinocytes, squamous cell carcinontia, basal cell carcinoma, 
hemangiopericytoma and Kaiposi's sarcoma. 

25 Polypeptides, polynucleotides, or modulators of polypeptides of the invention ^including 

inhibitors and stimulators of tibe biological activity of the polypeptide of Hns invention) may be 
administered to treat cancer. Thera3}eutic compositions can be administered in therapeutically 
efibctive dosages alone or in combination with adjuvant cancer ihexapy such as surgery, 
chemother^y, radio&er^y, thermotbexapy, and laser therapy, and may provide a beneficial 

30 efiect, e.g. reducing tumor size, slowing rate of tumor grov^ inhibiting metastasis, or otherwise 
improving overall clinical condition, without necessarily eradicating the cancer. 

The composition can also be administered in therapeutically effective amounts as a 
portion of an anti-cancer cocktail. An anti-cancer cocktail is a mixture of the polypqitide or 
modulator of the invention with one or more anti-cancer drugs in addition to a pharmaceutically 

3S acceptable carrier for delivery. The use of anti-cancer cocktails as a cancer treatment is routine. 
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Anti-cancer drugs that are weU known in the art and can be used as a treatment in combination 
with the polypeptide or modulator of Has invention include: Actinomycin D, Aminoghdetbimide, 
Asparaginase, Bleomycin, Busulfan, Caibopladn, Catmusdne, Chlorambucil, Cisplatin (cis- 
DDP), Cyclophosphamide, Cytarabine HCl (Cytosine arabinoside), Dacarbazine, Ddctinomycin, 
5 Daunorubicin HCl, Doxorubicin HCl, Estiamusdne phosphate sodium, Etoposide (V16-213), 
Floxuridine, S-Fluorouracil (5-Fu), Flutamide, Hydroxyurea (hydroxycarbamide), Ifos&mide, 
Interferon Alpha-2a, Interferon Alplia-2b, Leiqnolide acetate (LHRH-releasing factor analog), 
Lomiistine, Mechloretfaamine HCl (nitrogen*mustard), Mdphalan, Mercaptopurine, Mesna, 
Methotrexate (MTX), Mitomycin, Mitoxantnme HCl, Octreotide, PUcamycin, Procarbazine HCl, 

10 Streptozocin, Tamoxifen citrate, Thioguanine, Thiotq)a, Vinblastine sul&te. Vincristine sul&le, 
Amsacrine, Azacitidine, Hexamethylmelamine, Interleuldn-2, Mitoguazone, Pentostatin, 
Semustine, Teniposide, and Vindesine sulfate. 

In addition, therapeutic compositions of the invention may be used for prophylactic 
treatment of cancer. There are hoeditary coiMlitions and/or environmental situations (e.g. 

1 S exposure to carcinogens) known in the art that predispose an individual to develofong cancers. 
Under these circumstances, it may be beneficial to treat these individuals with tfaer^utically 
effective doses of the polypeptide of the invention to reduce the risk of developing cancers. 

In vitro models can be used to determine the effective doses of the polypeptide of the 
invention as a potential cancer treatment. These in vitro models include proliferation assays of 

20 cultured tumor cells, growth of cultured tumor cells in soft agar (see Freshney, (1 987) Culture of 
Animal Cells: A Manual of Basic Technique, Wily-Liss, New York, NY Ch 18 and Ch 21), 
tumor systems in nude mice as described in Giovanella et al., J. Natl. Can. Inst, 52: 921-30 
(1974), mobility and invasive potential of tumor cells in Boyden Chamber assays as described in 
Pilkington et al.. Anticancer Res., 17: 4107-9 (1997), and angiogenesis assays such as induction 

25 of vascularization of the chick chorioallantoic membrane or induction of vascular endothelial 
cell migration as described in Ribatta et al., Intl. J. Dev. Biol., 40: 1189-97 (1999) and Li et aL, 
CliiL Exp. Metastasis, 17:423-9 (1999), respectively. Suitable tumor cells lines are available, 
e.g. fiom American Type Tissue Culture Collection catalogs. 

30 4.10.12 BECEPTOR/UGAND ACTIVITY 

A polypeptide of the present invention may also demonstrate activity as Tecq}tor, 
receptor ligand or inhibitor or agonist of receptor/ligand interactions. A polynucleotide of the 
invention can encode a polypeptide exhibiting such characteristics. Examples of such receptors 
and ligands include, without limitation, cytokine receptors and their ligands, receptor kinases and 
35 their ligands, receptor phosphatases and their ligands, receptors involved in cell-cell ioteractions 
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and their ligands (including without limitation, cellular adhesion molecules (such as selectins, 
integrins and their ligands) and receptor/ligand pairs involved in antigen presentation, antigen 
recognition and development of cellular and himioral immune responses. Receptors and ligands 
are also usefiil for screeniog of potential peptide or small molecule inhibitors of the relevant 
5 receptor/ligand interactioiL A protein ofthe present invention (uudiiding,vvithoutliniitadon, 
fragments of receptors and Ugands) may tiiemselves be useful as inhibitors of receptta/ligand 
intstactions. 

The activity of a polypeptide of the invention may, among otber means, be measured by 
the following mediods: 

10 Suitable assays for receptar-ligand activity include without limitation those described in: 

Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. M. ' 
Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley- Intersdence (Chq>ter 7J28, 
Measurement of Cellular Adhesion imder static conditions 7.28.1- 7.28.22), Takai et al., Proc. 
Natl Acad. Sd. USA 84:6864-6868, 1987; Bierer et aL, J. Exp. Med. 168:1 145-1156, 1988; 

15 Rosenstem et al., J. Bxp. Med 169:149-160 1989; Stoltenborg et al., J. Immunol. Methods 
175:59-68, 1994; Stitt et al.. Cell 80:661-670, 1995. 

■ By way of example, the polypeptides of the invention may be used as a receptor for a 
ligand(s) ther^y transmitting the biological activity of that ligand(s). Ligands may be identified 
through binding assays, affinity chromatogr^hy, dihybrid soreening assays, BIAcore assays, gel 

20 oveday assays, or other methods known in the art 

Studies characterizing drugs or proteins as agonist or antagonist or partial agonists or a 
partial antagonist require the use of other proteins as competing ligands. The polypeptides of the 
present invention or ligand(s) thereof may be labded by being coupled to radioisotopes, 
colorimetric molecules or a toxin molecules by conventional methods. ("Guide to Protein 

25 Purification" Murray P. Deutscher (ed) Methods in Enzymology Vol. 182 (1990) Academic 
Press, Inc. San Diego). Exaniples of radioisotopes include, but are not limited to, tritium and 
carbon-14 . Examples of colorimetric molecules include, but are not limited to, fluorescent ■ 
molecules such as fluorescamine, or rhodamine or otiier colorimetric molecules, ^camples of 
toxins include, but are not limited, to ricin. 

30 

4.10.13 DRUG SCREENING 

This invention is particularly useful for screening chemical compounds by using the 
novel polypeptides or binding fragments thereof in any of a variety of drug screening techniques. 
The polypeptides or fiagments employed in such a test may either be fi:ee in solution, affixed to a 
35 solid support, home on a cell surface or located iotracellularly. One method of drug screening 
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Utilizes eukaiyotic or piokaryotic host cells wbich are stably transfoimed with recombinant 
nucleic acids expressing ihs polypeptide or a fragment thoeof. Ehngs are screened against such 
transformed cells in competitive binding assays. Such cells, either in viable or fixed form, can 
be used for standard binding assays. One may measure, for example, the formation of 
5 complexes betwe&a. polypeptides of the invention or fiagments and the agent being tested or 
examine the diminution in conq>lex formation between the novel polypeptides and an 
etppropiiate cell line, which are well known in the art 

Sources for test compounds that may be screened for ability to bind to or modulate (1 e. , 
increase or decrease) the activity of polypeptides of the invention include (1) inorganic and 

10 organic chemical libraries, (2) natural product libraries, and (3) combinatorial libraries 
comprised of either random or mimetic peptides, oligonucleotides or organic molecules. 

Chemical libraries may be readily synthesized or purchased firom a number of 
commercial sources, and may include structural analogs of known compounds or compounds 
that are identified as "hits" or "leads" via natural product screening. 

IS The sources of naturd product Ubiaries are microorganisms (mcludiiig bacteria and 

fimgi), animals, plants or other vegetation, or marine organisms, and libraries of mixtures for 
screening may be created by: (1) fermentation and extraction of broths from soil, plant or marine 
microorganisms or (2) extraction of tiie organisms themselves. Natural product libraries include 
polyketides, non-ribosomal peptides, and (non-naturally occurring) variants thereof. For a 

20 review, see Science 252:63-68 (1998). 

Combinatorial hbraries are conqwsed of large numbers of peptides, oligonucleotides or 
organic compounds and can be readily prepared by traditional automated synthesis methods, 
PCR, cloning or proprietary synthetic methods. Of particular interest are peptide and 
oligonucleotide combinatorial libraries. Still other libraries of interest include peptide, protein, 

25 peptidomimetic, multiparallel synthetic collection, recombinatorial, and polypeptide lilntiries. 
For a review of combinatorial chemistry and libraries created tiierefiom, see Myers, Curr. Opin. 
Biotechnol. 8:701-707 (1997). For reviews and examples of peptidomimetic libraries, see 
Al-Obeidi et al., Mol. Biotechnol, 9(3):205-23 (1998); Hruby etaL, Curr Opin Chem Biol, 
1(1):1 14-19 (1997); Domer et al., BioorgMed Chem, 4(5):709-15 (1996) (alkylated dipeptides). 

30 Identification of modulators through use of the various libraries described herein permits 

modification of the candidate "hit" (or 'lead") to optimize the cspaaty of the "hit" to bind a 
polypeptide of the invention. Hie molecules identified in the binding assay are then tested for 
antagonist or agonist activity in in vivo tissue culture or animal models that are well known in the 
art In hrie^ the molecules are titrated into a plurality of cell cultures or animals and then tested 

35 for either cell/animal death or prolonged survival of the animal/cells. 
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The binding molecules thus identified may be complexed with toxins, e.g., licin or 
cholera, or wiOi other compounds that are toxic to cells such as radioisotopes. Hie toxin-binding 
molecule complex is then targeted to a tumor or other cell by the specificity of the binding 
molecule for a polypeptide of the inventioiL Alternatively, the binding molecules may be 
S complexed with imaging agents for targeting and imaging purposes. 

4.10.14 ASSAY FOR RECEPTOR ACTIVITY 

The invention also provides methods to detect specific binding of a polypeptide e.g. a 
ligand or a receptor. The art provides numerous assays particularly useful for identifying 

10 previously unknown binding i)artners for receptor polypeptides of the invention. For example, 
expression cloning using mammalian or bacterial cells, or dihybrid screeiung assays can be used 
to identify polynucleotides encoding binding partners. As another example, afSnity 
cbromatogr^hy with the ^ipropriate immobilized polypeptide of the invention caii be used to 
isolate polypeptides tiiat recognize and bind polypeptides of the inventioiL There are a number 

IS of difierent libraries used for the identification of compounds, and in particular small molecules, 
that modulate (Le., increase or decrease) biological activity of a polypeptide of the inventioiL 
Ligands for receptor polypeptides of the invention can also be identified by adding exogenous 
ligands, or cocktails of ligands to two cells populatioiis that are genetically identical except for 
the e;q}res5ion of the receptor of the invention: one cell population expresses the receptor of the 

20 invention whereas tiie other does not The response of fhe two cell populations to the addition of 
ligands(s) are then compared. Alternatively, an expression library can be co-expressed with the 
polypqitide of tiie invention in cells and assayed for an autocrine response to identify potential 
ligand(s). As still another exanq>Ie, BIAcore assays, gel overlay assays, or other metiiods known 
in the art can be used to identify binding partner polypeptides, including, (1) organic and 

25 inorganic chemical libraries, (2) natural product libraries, and (3) combinatoiial libraries 
coirq)Tised of random peptides, oUgonucleotides or organic molecules. 

The role of downstream intracellular signaling molecules ia the signaling cascade of the 
polypeptide of the invention can be determined. For example, a chimeric protein in which the . 
cytoplasmic domain of the polypeptide of the invootion is fused to the extracellular portion of a 

30 protein, v^ose ligand has been identified, is produced in a host cell. The cell is then incubated 
with the ligand specific for tiie extracellular portion of the chimeric protein, thereby activating 
the dumoic receptor. Known downstream proteins involved in intracellular signaling can then 
be assayed for e}q>ected modifications ie. phosphorylation. Otiier methods known to those in the 
ait can also be used to identify signaling molecules involved in receptor activity. 

35 
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4.10.15 ANTI-BSFLAMMATORY ACnVITY 

Compositioos of the present inveDtion may also exhibit anti-inflammatoiy activity. The 
antj-inflamTnatoiy activity may be achieved by providing a stumihis to cells involved in the 
inflammatory response, by inhibiting or promoting ceU-cell interactions (sudi as, for example, 
5 cell adhesion), by inhibiting or promoting chemotaxis of cells involved in the inflammatory 
process, inhibiting or promoting cell extravasation, or by stimulating or suppressing production 
of other &ctors which more directly inhibit or promote an inQammatory response. Compositions 
with such activities can be used to treat inflammatory conditions including chronic or acute 
conditions), incliiding without limitation intinmtion associated with infection (such as se^ 

10 shock, sepsis or systemic inflammatory response syndrome (SIRS)), isdiemia-reperfiision injury, 
endotoxin lethality, arthritis, complement-mediated hyperacute rejection, nephritis, cytokine or 
chemokine-induced lung injury, inflammatory bowel disease, Crohn's disease or resulting &om 
over production of cytokines such as TNF or IL-1. Compositions of the invention may also be 
useful to treat an^hylaxis and hypersensitivity to an antigenic substance or material. 

IS Compositions of this invention may be utilized to prevent or treat conditions such as, but not 
limited to, sepsis, acute pancreatitis, endotoxin shock, cytokine Induced shock, rheumatoid 
arthritis, chronic inflammatory arthritis, pancreatic cell damage fiom diabetes mellitus type 1, 
graft versus host disease, inflammatory bowel disease, inflamation associated with pulmonary 
disease, other autoimmune disease or inflammatoiy disease, an antiptroliferative agent such as for 

20 acute or chronic mylegenoiis leiikemia or in the prevention of premature ]aboi secondary to 
intrauterine infections. 

4.10.16 LEUKEMIAS 

Leukemias and related disorders may be treated or prevented by administration of a 
25 therapeutic that promotes or inhibits function of the polynucleotides and/or polypeptides of the 
invention. Such leukemias and related disorders include but are not limited to acute leukemia, 
acute lymphocytic leukemia, acute myelocj^c leukemia, myeloblastic, promyelocytic, 
myelomonocytic, monocytic, erytfaroleukemia, chronic leukemia, chronic myelocytic 
(granulocytic) leukemia and chrome lymphocytic leukemia (for a review of such disorders, see 
30 Fishman et al., 1985, Medicine, 2d Ed, JB. Lippincott Co., Philadelphia). 

4.10.17 NERVOUS SYSTEM DISORDERS 

Nervous system disorders, involving cell types which can be tested for efficacy of 
intervention with compounds that modulate the activity of the polynucleotides and/or 
35 polypeptides of the invention, and which can be treated upon thus observing an indication of 
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thenq>eutic utility, include but are not limited to nervous system rajuries, and diseases or 
disorders -vMch result in either a disconnection of axons, a diminution or degraeration of 
neurons, or demyelinatioa Nervous Systran lesions yMch may be treated in a patient (including 
human and non-human mammalian patients) according to the invention include hat are not 
5 limited to the following lesions of either the central including spinal cord, brain) or peripheral 
nervous systrans: 

(i) traumatic lesions, including lesions caused by physical injury or associated with 
surgery, for example, lesions which sevdr a portion of the nervous Systran, or compression 
injuries; 

10 (ii) ischemic lesions, in vMtii a lack of oxygen in a portion of the nervous system 

results in neuroiial injury or death, including cerebral in&rction or ischemia, or spinal coid 
infarction or ischemia; 

(iii) infectious lesions, in vtdiich a portion of the nervous sj^stem is destroyed or 
injured as a result of infection, for example, by an abscess or assodated with infection by human 

IS immunodeficiency virus, herpes zoster, or h^pes simplex virus or with Lyme disease, 
tuberculosis, syphilis; 

(iv) degenerative lesions, in vMch a portion of the nervous system is destroyed or 
injured as a result of a degenerative process including but not limited to degeneration associated 
with Parkinson's disease, Alzheimer's disease, Huntington's chorea, or amyotrophic lateral 

20 sclerosis 

(v) lesions associated widi nutritional diseases or disorders, in v\Uch a portion of the 
nervous system is destroyed or injured by a nutritional disorder or disorder of metabolism 
including but not limited to, vitamin B12 deficiency, folic acid deficiency, Wemicke disease, 
tobacco-alcohol amblyopia, Marchia&va-Bignami disease (primary degeneration of the corpus 

25 callosum), and alcoholic cerebellar degeneration; 

(vi) neurological lesions associated with systemic diseases including but not limited to 
diabetes (diabetic neuropathy. Bell's palsy), systemic hxpas erythematosus, carcinoma, or 
sarcoidosis; 

(vii) lesions caused by toxic substances including alcohol, lead, or particular 
30 neurotoxins; and 

(viii) demyelinated lesions in which a portion of the nervous system is destroyed or 
injured by a demyelinating disease including but not limited to multiple sclerosis, human 
immunodeficiency virus-associated myelopaQiy, transverse myelopathy or various etiologies, 
progressive multifocal leukoencephalopathy, and central pontine myelinolysis. 
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Therapeutics which are useful according to the invention for treatment of a nervous 
system disorder may be selected by testing for biological activity in promoting the survival or 
differentiation of neurons. For example, and not by way of limitation, theiapeutics v/iuch. elicit 
any of the following effects may be useful according to the invention: 
S (i) increased survival time of neurons in culture; 

(ii) increased sprouting ofneurons in culture or in vivo; 

(iii) increased production of a neuron-associated molecule in culture or in vivo, e.g. , 
dioline acetyhransfaase or acetylcholinesterase with respect to motor neurons; or 

(iv) decreased symptoms of neuron dysfunction in vivo. 

10 Sudi effects may be measured by any method known in the art In preferred, 

non-limiting embodiments, increased survival of neurons may be measured by the method set 
forth in Arakawa et al. (1990, J. Neurosci. 10:3507-3515); increased sprouting of neurons may 
be detected by metiiods set forth in Pestronk et al. (1980, Esqp. Neurol. 70:65-82) or Brown et aL 
(1981, Ann. Rev. Neurosci. 4:17-42); increased production of neuron-associated moleoiles may 

IS be measured by bioassay, enzymatic assay, antibody binding. Northern blot assay, e/c, 

depending on the molecule to be measured; and motor neuron dysfunction may be measured by 
assessing the physical manifestation of motor neuron disorder, e.g., weakness, motor neuron 
conduction velocity, or functional disability. 

In specific embodiments, motor neuron disorders that may be treated according to the 

20 invention include but are not limited to disorders such as infarction, infection, exposure to toxin, 
trauma, surgical dam^e, degenerative disease or malignancy tiiat may affect motor neurons as ■ 
well as other components of the nravous system, as well as disorders that selectively affect 
neurons such as amyotrophic lateral sclerosis, and including but not limited to progressive spinal 
muscular atrophy, progressive bulbar palsy, primary lateral sclerosis, in&ntile and juvoiile 

25 muscular atrophy, progressive bulbar paralysis of childhood (Pazio-Londe syndrome), 
poliomyelitis and the post polio syndrome, and Hereditary Motorsensory Neuropathy 
(Charcot-Marie-Tooth Disease). 

4.10.18 OTHER ACll Vl'l'lES 

30 A polypqjtide of Ifae invention may also exhibit one or more of the following additional 

activities or effects: inhibiting the growth, infection or function oj^ or killing, infectious agents, 
including, without limitEction, bacteria, viruses, fimgi and otiier parasites; effecting (suip:<essing 
or enhancing) bodily characteristics, including, witiiout limitation, height, weight, hair color, eye 
color, skin, &t to lean ratio or other tissue pigmentation, or organ or body part size or shape 

35 (such as, for example, breast augmentation or diminution, diange in bone form or sh^>e); 
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effecting biorhythms or circadian cycles or rhytiims; effecting the fertility of male or female 
subjects; effecting the metabolism, catabolism, anabolism, procesang, utilization, storage or 
elimination of dietary fet, lipid, protein, carbohydrate, vitamins, mineaals, co-fectors or other 
nutritional fectors or component(s); effecting behavioral characteristics, including, wilfaout 
limitation, appetite, libido, sb^, cognition (including cognitive disorders), depression 
(including depressive disorders) and violent behaviors; providing analgesic effects or other jain 
reducing effects; promoting differentiation and growth of embryonic stem cells in lineages other 
than hematopoietic lineages; hormonal or endocrine activity; in the case of enzymes, correcting 
deficiencies of the enzyme and treating defidency-related diseases; treatment of 
hyperproliferative disorders (such as, for example, psoriasis); immunoglobulin-like activity (sudi 
as, for example, the ability to bind antigens or complement); and the ability to act as an antigen 
in a vaccine composition to raise an immune response against such protein or another material or 
entity which is cross-reactive with such protein. 

4.10.19 IDENTIFrCATION OF POLYMORPHISMS 

The demonstration of polymoiphisms makes possible the identification of such 
polymorphisms in human subjects and the pharmacogenetic use of this infomwtion for diagnosis 
and treatment Such polymorphisms may be associated with, e.g., differential predisposition or 
susceptibility to various disease states (such as disorders involving inflammation or immime 
response) or a differential response to drug administration, and tihis genetic information can be 
used to tailor preventive or therapeutic treatment appropriately. For example, the existence of a 
polymorphism associated with a predisposition to inflammation or autoimmune disease makes 
possible the diagnosis of this condition in humans by identifying the presence of tiie 
polymorphism. 

Polymorphisms can be identified in a variety of ways known in the art v^ch all 
generally involve obtaining a sample ftom a patient, analy2ang DNA fix)m the sanq>le, optionally 
involving isolation or amplification of the DNA, and identifying the presence of the 
polymorphism in the DNA For example, PGR may be used to amplify an appropriate fragment 
of genomic DNA which may then be sequaiced. Altranatively, the DNA may be subjected to 
altele-specitic oligonucleotide hybridization (in v^ch appropriate oligonucleotides are 
hybridized to the DNA under conditions permittir^ detection of a single base mismatch) or to a 
single nucleotide extension assay (Jin vddch an oligonucleotide that hybridizes immediately 
adjacent to tiie position of the polymorphism is extended with one or more labeled nucleotides). 
In addition, traditional restriction fragment length polymorphism analysis (using restriction 
enzymes that provide diferential digestion of the genomic DNA depending on the presence or 
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absence of the polymoiphism) may be perfonned. Airays with nucleotide sequences of the 
present invention can be used to detect polymoiphisms. The array can comprise modified 
nucleotide sequences of the present invention in order to detect ttie nucleotide sequences of the 
present invention. In the alternative, any one ofihe nucleotide sequences of tfie present 
S invention can be placed on the array to detect changes from those sequences. 

Alternatively a polymorphism resulting in a change in the amino acid sequence coiild 
also be detected by detecting a corresponding change in amino add sequence of the protein, e.g., 
by an antibody specific to the variant sequence. 

10 4.10^0 ARTHRITIS AITOD^FLAMMATION 

The iomiunosqppressive effects of the compositions of the invention against rheumatoid 
arthritis is detennioed in an experimental animal model system. The experimental model system 
is adjuvant induced arthritis in rats, and the protocol is described by J. Holoshitz, et at, 1983, 
Science, 219:56, or by B. Waksman et al., 1963, InL Arch. Allergy ^1. LnmunoL, 23:129. 

1 S Induction of the disease can be caused by a single injection, generally intradennally, of a 
suspension of killed Mycobacterium tuboculosis in complete Freund's adjuvant (CPA). The 
route of injection can vary, but rats may be injected at the base of the tail with an adjuvant 
mixture. Hie polypeptide is administered in phosphate buffered solution (PBS) at a dose of about 
1 -5 mg/kg. The control consists of fidministerii^ PBS only. 

20 The i)rocedure for testing the effects ofthe test compound woiild consist of intradennally 

injecting killed Mycobacterium tuberculosis in CFA followed by immediately administering the 
test compound and subsequent treatment every other day until day 24. At 14, 1 5, 1 8, 20, 22, and 
24 days after injection of Mycobacterium CFA, an overall arthritis score may be obtained as 
described by J. Holoskitz above. An analysis of the data would reveal that the test conqraimd 

25 would have a dramatic affect on tiie swelling of the joints as measured by a decrease of the 
arthritis score. 

4.11 THERAPEUTIC METHODS 

The compositions (including polypeptide fragments, analogs, variants and antibodies or 
30 other binding partners or modulators including antisense polynucleotides) of the invention have 
numerous plications in a variety of therapeutic metiiods. Examples of tiier^utic ^pUcations 
include, but are not limited to, those exemplified herein. 

4.11.1 EXAMPLE 
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One embodiment of ibs invention is the administration of an effective amount of the 
polypeptides ot o&er composition of tlie invention to individuals affected by a disease or 
disorder that can be modulated by regulating the peptides of the invention. While Hie mode' of 
administration is not particularly important, parenteral administration is preferred. An 
S exemplary mode of administration is to deliver an intravenous bolus. The dosage of tiie 
polypeptides or other composition of the invention mil normally be determined by the 
prescribing physician. It is to be expected that the dosage will vary according to Hbe age, weight; 
conditionandresponseof the individual patient Typically, the amount of polypeptide 
administered per dose will be in the range of about 0.01 (xg/kg to 100 mg/kg of body weight, with 

10 the preferred dose being about O.l^g/kg to 10 mg/kg of patient body weigh! Forparenteral 

administration, polypeptides of the invention will be formulated in an injectable form combined 
witix a phaimaceutically acceptable parenteral vehicle. Such vehicles are well known in the art 
and examples include water, saline, Ringer's solution, dextrose solution, and solutions consisting 
of small amounts of the hiunan serum albumiiL The vehicle may contain minor amounts of 

1 S additives that maintain the isotonicity and stability of the polypeptide or other active ingredient 
The preparation of such solutions is witiiin the skill of the art 

4.12 PHARMACEUTICAL FORMULATIONS AND ROUTES OF 
ADMINISTRATION 

20 A protein or other cocdposition of the present invention (fiom whatever source derived, 

including without limitation fix)m recombinant and non-recombinant sources and including 
antibodies and other binding partners of tbe polypeptides of the invention) may be administered 
to a patient in need, by itself or in pharmaceutical compositicHis where it is mixed with suitable 
carriers or excipient(s) at doses to treat or ameliorate a variety of disorders. Such a composition 

25 may optionally contain (in addition to protein or other active ingredient and a carrier) diluents, 
fillers, salts, buffers, stabilizers, solubilizers, and other materials well known in the art The term 
"phaimaceutically acceptable" means a non-toxic material that does not interfere with the 
efifectiveness of the biological activity of the active ingredient(s). The characteristics of the 
carrier will depend on the route of administration. The pharmaceutical composition of the 

30 invention may also contain cytokines, lymphokines, or other hematopoietic fectors such as 

M-CSF, GM-CSF, TNF. EL-l, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-1 1, IL-12, 
IL-13, IL-14, IL-15, IFN, TNFO, TNFl, TNF2, G-CSF, Meg-CSF, tiHombopoietin, stem cell 
&ctor, and erythropoietin. Li further compositions, proteins of the invention may be combined 
with other agents beneficial to the treatment of the disease or disorder in question. These agents 

35 include various growth Actors such as epidermal growth fictor (EGF), platelet-derived grov/th 
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factor (PDGF), transfoiming growth fectors (TGF-a and TGF-P), insulin-like growth fector 
(IGF), as well as cytokines desmbed herein. 

The pbannaceutical con^sition may &rther contain other agents which either enhance 
the activity of tiie protein or other active ingredient or complement its activity or use in 
treatment Such additional &ctors and/or agents may be included in the phammceutical 
composition to produce a synergistic eflFect with protein or other active ingredient of the 
invention, or to minimize side effects. Conversely, protein or other active ingredient of the 
present invention may be included in formulations of the particular clotting fector, cytokine, 
lympholdne, other hematopoietic &ctor, thrombolytic or anti-thrombotic factor, or anti- 
inflammat ory agent to minimize side effects of the clotting fector, cytokine, lyn^okine, other 
hematopoietic fector, thrombolytic or anti-thrombotic fector, or anti-inflammatoiy agent (such as 
IL-lRa, IL-1 Hyl, IL-1 Hy2, anti-TNF, corticosteroids, immunosuppressive agents). A protein 
of the present invention may be active in multimers (e.g., heterodimers or homodimers) or 
complexes with itself or other proteins. As a result, phamiaceutical compositions of the 
invention may comprise a protein of the invention in such multimeric or complexed fomL 

As an alternative to being included in a pharmaceutical composition of the invention 
incliiding a first protein, a second protein or a therapeutic agent may be concurrently 
ad minis tered with fee first protein (e.g., at fee same time, or at difl^ing times provided that 
1iier£q)eutic concentrations of fee combination of agents is achieved at fee treatment site). 
Techniques for fermulation and administration of the compounds of fee instant application may 
be found in "Remington's Pharmaceutical Sciences," Mack Publishing Co., Easton, PA, latest 
edition. A feerapeutically effective dose finfeer refers to that amoimt of fee compoimd sufScient 
to result in amelioration of syn^Ttoms, e.g., treatment, healing, prevention or amelioration of fee 
relevant medical condition, or an increase in rate of treatment, healing, prevention or 
amelioration of such conditions. When ^lied to an individual active ingredient, administoed 
alone, a feerq)eutically effective dose refers to that ingredient alone. When applied to a 
combination, a feerapeutically effective dose refers to combined amounts of fee active 
ingredients that result in fee feerapeutic effect, vfeefeer administered in combination, serially or 
simultaneously. 

In practicing fee mefeod of treatment or use of fee present invention, a feerapeutically 
effective amount of protein or ofeer active ingredient of fee present invention is administered to 
a mammal having a coixiition to be treated. Protein or other active ingredient of the present 
invention may be administered in accordance wife tiie mefeod of the invention either alone or in 
combination wife ofeer feerapies such as treatments employing cytokines, lymphokines or ofeer 
hematopoietic fectors. When co- administered wife one or more cytokines, lymphokines or ofeer 
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hematopoietic factors, protein or o&er active ingredient of tbe present invention be 
administered either simuhaneously with the c)rtoldne(s), lymphokine(s), other hematopoietic 
factor(s), tiirombolytic or anti-thrombotic &ctois, or sequeotially. If administered sequentially, 
the attending physician will decide on the ^jprupriate sequence of administering protein or other 
5 active ingredient of the present invention in combination with cytoldne(s), IyinphokinB(s), other 
hematopoietic factor(s), thrombolytic or anti-thrombotic fiictors. 

4.12.1 ROUTES OF ADMINISTRATION 

Suitable routes of administration may, fot example, include oral, rectal, transmucosal, or 
10 intestinal administration; parenteral delivery, including intramuscular, subcutaneous, 
intramedullary injections, as well as intrathecal, direct intraventricular, intravenous, 
intrapentoneal, intranasal, or intraocular injections. Administration of protein or other active 
ingredient of the present invention used in the phaimaceutical composition or to practice ihe 
method of the present invention can be carried out in a variety of conventional ways, such as oral 
IS ingestion, inhalation, topical application or cutaneous, subcutaneous, intr{q)eritoneal, parenteral 
or intravenous injectioiL Intravenous administration to the patient is preferred. 

Alternately, one niay administer the compound in a local rather tiian systemic manna, for 
example, via injection of the compound directiy into a arthritic joints or in flbrotic tissue, often 
in a depot or sustained release formulation. In order to prevent the scarring process frequently 
20 occurring as complication of glaucoma surgery, tiie compoimds i!aay be administered topically, 
-for example, as eye drops. Furfiiennore, one may administer the drug in a targeted drug delivoy 
system, for example, in a liposome coated with a specific antibody, targeting, for example, 
arthritic or fibrotic tissue. The liposomes will be targeted to and taken up selectively by the 
afOicted tissue. 

25 Tlie polypeptides of the invention are administered by any route that delivers an effective 

dosage to tbe desired site of action. The determination of a suitable route of administration and 
an effective dosage for a particular indication is within tiie level of skill in the art Preferably for 
wound treatment, one administers the therapeutic compound directly to the site. Suitable dosage 
ranges for die polypeptides of the invention can be extn^lated &om these dosages or from 

30 similar studies in ^propriate animal models. Dosages can then be adjusted as necessary by the 
clinician to provide maximal ther^jeutic benefit. 

4.12.2 COMPOSmONS/FORMULATIONS 

Phaimaceutical compositions for use in accordance with tiie present invention thus may 
3S be formulated in a conventional manner using one or more physiologically acceptable carriers 
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compiismg excipients and auxiliaries wfaich &cilitate processii]g of the active compounds into 
prepatatioiis \^^ch can be used phaimaceutically. These phannaceutical compositions may be 
manu&ctured in a manner that is itself kaowo, e.g., by means of conventional mixing, 
dissolving, granulating, diagee-maldng, levigating, emulsifying, encapsulating, entrapping or 
5 lyo^Mizing processes. Propo' fonnulation is dependent upon -die route of adnunistration 
chosen. When a thor^uticaUy effective amount ofprotein or other active ingredient of the 
present invention is administored orally, protein or other active ingredient of the present 
invention will be in die form of a tablet, capsule, powder, solution or elixir. When administered 
in tablet form, the pharmaceutical composition of 'die invention may additionally contain a solid 

10 carrier such as a gelatin or an adjuvant The tablet, capsule, and powder contain fiom about S to 
95% protein or other active ingredient of the present invention, and preferably fiom about 25 to 
90% protein or other active ingredient of the present invention. When administered in liquid 
form, a liquid carrier such as water, petroleum, oils of animal or plant origin such as peanut oil, 
mineral oil, soybean oil, or sesame oil, or synthetic oils may be added. The liquid form of the 

IS pharmaceutical composition may fiirther contain physiological saline soliition, dextrose or other 
sacdiaride solution, or glycols such as ethylene glycol, propylene glycol or polyethylene glycol. 
When administered in liquid form, fbe pharmaceutical composition contains fiom about 0.5 to 
90% by weight of protein or other active ingredient of the present invention, and preferably from 
about 1 to 50% protein or other active ingredient of the present invention. 

20 When a therapeutically effective amoumt of protein or other active ingredient of the 

present invention is administered by intravenous, cutaneous or subcutaneous injection, protein or 
other active ingredient of tiie present invention will be in the form of a pyrogen-firee, parenterally 
acceptable aqueous solution. The preparation of such parenterally acceptable protein or other 
active ingredient solutions, having due regard to pH, isotonicity, stability, and the like, is within 

25 the skill in the art A preferred pharmaceutical composition for intravenous, cutaneous, or 
subcutaneous injection should contain, in addition to protein or other active ingredient of the 
present invention, an isotonic vehicle such as Sodium Chloride Injection, Ringer's Injection, 
Dextrose Injection, Dextrose and Sodium Chloride Injection, LacMed Ringer's Injection, or 
otiier vehicle as known in the art The pharmaceutical composition of the present invention may 

30 also contain stabilizers, preservatives, bufTers, antioxidants, or otiier additives known to those of 
skill in the ait For injection, the agents of the invention may be formulated in aqueous 
solutions, preferably in physiologically compatible buffers such as Hanks's solution. Ringer's 
solution, or physiological saline buffer. For transmucosal administration, penetrants appropriate 
to the batrier to be permeated are used in the formulation. Such penetrants are generally known 

35 in the art 
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For oial admimstratioii, the compounds can be formulated readily by combining ttie 
active compounds Mith phaimaceudcally acceptable carriers well known in ^ art Such carriers 
enable the compounds of the invention to be finrnulated as tablets, pills, dragees, cq>sules, 
liquids, gels, syrups, slurries, suspensions and the like, for oral ingestion by a patient to be 
S treated. Pharmaceutical preparations for oral use can be obtained fiom a solid excipient, 
optionally grinding a resulting mixture, and processing the mixture of granules, after adding 
suitable auxiliaries, if desired, to obtain tablets or dragee cores. Smtable excipieots are, in 
particular, fillers such as sugars, including lactose, sucrose, mannitol, or sorbitol; cellulose 
preparations such as, for exanq>le, maize standi, wheat starch, rice starch, potato starch, gelatin, 

10 gum tragacanth, methyl cellulose, hydroxypropylmethyl-cellulose, sodium 

carboxymethylcellulose, and/or polyvinylpyrrolidone (PVP). If desired, disintegrating agents 
may be added, such as the cross-linked polyvinyl pyrrolidone, agar, or alginic add or a salt 
thereof such as sodium alginate. Dragee cores are provided with suitable coatings. For this 
putpose, concentrated sugar solutions may be used, which may optionally contain gum arable, 

IS talc, polyvinyl pyrrolidone, carbopol gel, polye&ylene glycol, and/or titanium dioxide, lacquer 
solutions, and suitable oiganic solvents or solvent mixtures. Dyestuf^ or pigments may be 
added to Hie tablets or dragee coatings for identification or to characterize different combinations 
of active compound doses. 

Phaimaceutical prq>arations which can be used orally include push-fit capsules made of 

20 gelatin, as well as soft, sealed capsules made of gelatin and a plasticizer, such as glycerol or 

sorbitol. The push-fit capsules can contain the active ingredients in admixture with Silex such as 
lactose, binders such as starches, and/or lubricants such as talc or magnesium stearate and, 
optionally, stabilizers. In soft capsules, tins active compounds may be dissolved or suspended in 
suitable liquids, such as fstty oils, liquid parafBn, or liquid polyeQiylene glycols. In addition, 

25 stabilizers may be added. All formulations for oral administration should be in dosages suitable 
for such administration. For buccal administration, the compositions may take the form of 
tablets or lozenges formulated in conventional manner. 

For administration by idialation, tiie compounds for use according to the present 
invention are convenientiy delivered in the form of an aerosol sjnray presentation fix>m 

30 pressurized packs or a nebuliser, with tiie use of a suitable propellant, e.g., 

dichlorodifhiorometiiane, trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide or 
otiier suitable gas. In the case of a pressurizsed aerosol the dosage unit may be determined by 
providing a valve to deliver a metered amoimt Capsules and cartridges of, e.g. , gelatin for use 
in an inhaler or insufOator may be formulated containing a powder mix of the compound and a 

35 suitable powder base such as lactose or starch. The compounds may be formulated for parenteral 
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administration by injectioii, e.g., by bolus injection or continuoxis infusion. Foimulations for 
injection may be presented in unit dosage form, e.g.,'m ampules or in muM-dose containers, with 
an added preservative. The compositions may take such forms as suspensions, solutions or 
emulsions in oily or aqueous vehicles, and may contain formidatory agents such as suspending, 
5 stabiliang and/or dispersing agoits. 

Pharmaceutical formulations for parenteral administration incliide aqueous solutions of 
the active compounds in water-soluble form. Additionally, suspensions of tiie active coixtpounds 
m^ be prepared as q>proptiate oily injection suspensions. Suitable l^philic solvents or 
vehicles include &tty oils such as sesame oil, or syn&etic fatty acid est«rs, such as efhyl oleate or 

10 triglycerides, or liposomes. Aqueous injection suspensions may contain substances which 
increase tiie viscosity of ^e suspension, such as sodium carboxymethyl cellulose, sorbitol, or 
dextran. Optionally, the suspension may also contain suitable stabilizers or agents \^ch 
increase the solubility of the compounds to allow for the preparation of bigbly concentrated 
solutions. Alternatively, the active ii^redient may be in powder form for constitution with a 

IS suitable vehicle, e.g., sterile pyrogen-firee water, before use. 

The compounds may also be formulated in rectal compositions such as suppositories or 
retention enemas, e.g., containing conventional siqjpository bases such as cocoa butt^ or olher 
glycerides. In addition to the formuliations described previously, the compounds may also be 
formulated as a depot preparation. Such long acting formulations may be administered by 

20 implantation (for example subcutaneously or intramuscularly) or by intramuscular injection. 
Thus, for example, the conq>ounds may be formulated with suitable polymeric or hydrophobic 
materials (for example as an emulsion in an accqptable oil) or ion exchange resins, or as 
sparingly soluble derivatives, for example, as a sparingly soluble salt. 

A pharmaceutical carrier for Has hydrophobic compounds of the invention is a co-solvent 

25 system contpiising benzyl alcohol, a noiqx>lar surfoctant, a water-miscible orgemic polymer, and 
an aqueous phase. The co-solvent system may be &eVPD co-solvent system. VPD is a solution 
of 3% w/v benzyl alcohol, 8% wA^ of the nonpolar surfoctant polysorbate 80, and 65% vf/v 
polyethylene glycol 300, made up to volume in absolute ethanol. The VPD co-solvent system 
(VPDiSW) consists of VPD diluted 1 : 1 with a 5% dextrose in watCT solution. This co-solvent 

30 system dissolves hydrophobic compounds well, and itself produces low toxicity upon systemic 
administration. Naturally, the proportions of a co-solvent system may be varied considerably 
without destroying its solubility and toxicity characteristics. Furthermore, the identity of the 
co-solvent components may be varied: for example, ofhia low-toxidty noiqmlar sur&ctants may 
be used instead of polysorbate 80; the fiaction size of polyethylene ^ycol may be varied; other 

35 biocompatible polymers may replace poly^ylene glycol, e.g. polyvinyl pyrrolidone; and other 
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sugars or polysaccharides may substitute for dextrose. Alternatively, other delivery systems for 
hydrophobic pharmaceudcal compounds may be enqjloyed. Liposomes and emulsions are well 
known examples of delivoy vehicles or carriers for hydrophobic drugs. Certain organic solvents 
such as dimethylsulfoxide also may be employed, although usually at the cost of greater toxicity. 
5 Additionally, the compounds may be delivered using a sustained-release system, such as 
semipermeable matrices of solid hydrophobic polymers containing the lher^)eutic agent 
Various types of sustained-release materials have been established and are well known by those 
skilled in tiie art. Snstained-release capsules may, depending on ib&ir chemical nature, release the 
compounds for a few weeks up to over 1 00 days. D^nding on the chemical nature and the 

10 biological stability of the tiierapeutic reagent, additional strategies for protein or other active 
ingredient stabilization may be employed. 

The pharmaceutical compositions also may comprise suitable solid or gel phase carrirars 
or excipients. Examples of such carriers or excipients include but are not limited to calcium 
carbonate, calcium phosphate, various sugars, stardies, cellulose d«ivatives, gelatm, and 

15 polymers such as polyethylene glycols. Many of the active ingredients of the invention may be 
provided as salts with phatmaceutically compatible counter ions. Such phaimaceuticaUy 
acceptable base addition salts are those salts which retain the biological effectiveness and 
properties of the finee acids and which are obtained by reaction with inorganic or organic bases 
such as sodium hydroxide, magnesium hydroxide, ammonia, trialkylamine, dialkylamine, 

20 monoalkylamine, dibasic ammo acids, sodium acetate, potassium benzoate, triethanol amine and 
the like. 

The pharmaceutical composition of the invention may be in the form of a complex of the 
protein(s) or other active ingrediBnt(s) of present invention along with protein or peptide 
antigens. The protein and/or peptide antig«i will deliver a stimulatory signal to both B and T 

25 lymphocytes. B lymphocytes will respond to antigen through their sur&ce iinmunoglobulin 
receptor. T lymphocytes will respond to antigen through tiie T cell receptor (TCR) following 
presentation of the antigen by MHC proteins. MHC and structurally related proteins including 
those encoded by class I and class U MHC genes on host cells will serve to present the pq>tide 
antigen(s) to T lymphocytes. The antigen components could also be siqjplied as purified 

30 MHC-peptide complexes alone or with co-stimulatory molecules that can directly signal T cells. 
Ahematively antibodies able to bind sur&ce immunoglobulin and other molecules on B cells as 
well as antibodies able to bind tihe TCR and other molecules on T cells can be combmed with the 
pharmaceutical composition of the invention. 

The pharmaceutical composition of the invention may be in the form of a liposome in 

35 . ti^ch protein of the present invention is combined, in addition to other phaimaceutically 
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acceptable caniegrs, witb amphipafhic agents such as lipids winch exist in aggregated farm as 
micelles, insoluble monolayers, liquid crystals, or lameUar layers in aqueous solutioiL Sidtable 
lipids for liposomal formulation include, without limitation, monoglycerides, diglycerides, 
sut&tides, lysolecithins, phospholipids, saponin, bile acids, and the like. Preparation of such 
5 liposomal formulations is \vithin the level of skill in the art, as disclosed, for exanq)le, in U.S. 
Patent Nos. 4,235,871; 4,501,728; 4,837,028; and 4,737,323, aU of which are incorporated 
herein by reference. 

The amount of protein or other active ingredient of the present invention in the 
pharmaceutical composition of the present invention will dq>end vpoa the nature and severity of 

10 the condition being treated, and on the nature of prior treatments which ihe patient has 

mdergone. Ultimately, the attending physician will decide the amount of protein or other active 
ingredient of the present invention with which to treat each individual patient Initially, the 
attending physician will iidminister low doses of protein or other active ingredient of the present 
invention and observe the patient's respoiise. Larger doses of protein or.other active ingrediait 

15 of the present invention may be administered imtil the optimal then^utic effect is obtained for 
the patient, and at that point the dosage is not increased iiirtfaer. It is contenq)lated diat the 
various pharmaceutical conq)ositions used to practice the method of the present invention should 
contain about 0.01 \ig to about 100 mg (preferably about' 0. 1 (ig to about 1 0 mg, more preferably 
about 0. 1 p,g to about 1 mg) of protein or other active ingredient of the pnresent invention per kg 

20 body weight For compositions of the present iavention which are usefiil for bone, cartilage, 
tendon or ligament regeneration, 'die thra^)eutic meOiod includes administering the composition 
topically, systematically, or locally as an iniplant or device. When administered, the therapeutic 
composition for use in this invention is, of course, in a pyiogen-fiee, ph3rsiologically acceptable 
form. Further, the c(»nposition may desirably be encapsulated or injected in a viscous form for 

25 delivery to the site of bone, cartilage or tissue damage. Topical administration may be suitable 
for wound healing and tissue repair. Therapeutically useful agents other than a protein or other 
active ingredient of the invention which may also optionally be included in tiie conqx>sition as 
described above, may alternatively or additionally, be administered simultaneously or 
sequentially with the compc^tion in the methods of the invention. Preferably for bone and/or 

30 cartilage formation, the con^sition would include a matrix enable of deUvering the 

protein-containing or otbet active ingredient-containing composition to the site of bone and/or 
cartilage damage, providing a structure for the developing bone and cartilage and optimally 
cq)able of being resorbed into the body. Such matrices may be formed of materials presently in 
use for o^er implanted medical applications. 
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The choice of matrix material is based on biocompatibility, biodegradability, mechaoical 
properties, cosmetic appeataace and intet&ce properties. The particular ^plication of the 
compositions will define the appropriate formulation. Potential matrices for the compositions 
may be biodegradable and chemically defined calcium sul&te, tricalcium phosphate, 
5 hydroxyapatite, polylactic acid, polyglycolic acid and polyanhydrides. Ottier potential materials 
are biodegradable and biologically well-defined, such as bone or dermal collagen. Furflier 
matrices are comprised of pure proteins or extracellular matrix components. Other potential 
matrices are nonbiodegradable and chemically defined, such as sintered hydro}^^)atite, bioglass, 
aluminates, or other ceramics. Matrices may be comprised of combinations of any of the above 

10 mentioned types of material, such as polylactic acid and hydroxyq>atite or collagen and 
tricalci\mi phosphate. The bioceramics may be altered in composition, such as in 
calcium-alunrinate-phosphate and processing to alter pore size, particle size, particle shape, and 
biodegradability. Presently preferred is a 50:50 (mole weight) copolymer of lactic acid and 
glycolic add in flie form of porous particles having diameters ranging fix>m 150 to 800 microns. 

15 In some ^TpUcations, it will be usefid to utilize a sequestering agent, such as carboxymethyl 

cellulose or autologous blood clot, to prevent the protein compositions fix>m disassociating finm 
the matrix. 

A preferred &mily of sequestering agents is cellulosic mat^als such as alkylcelluloses 
(including hydroxyaDcylcelluloses), including methylcellulose, ethylcellulose, 

20 hydroxyethylcellulose, hydroxypropylcellulose, hydroxypropyl-methylcellulose, and 

carbo»ymelhylceIlulose, flie most preferred being cationic salts of carboxymethylcellulose 
(CMC). Other preferred sequestering agents include hyaluronic add, sodium alginate, 
poly(ethylene glycol), polyoxyethylene oxide, carboxyvinyl polymer and poly(vinyl alcohol). 
The amount of sequestering agent useful herein is 0.5-20 wt %, preferably 1-10 wt % based on 

25 total formulation weight, which represents the amount necessary to prevent desorption of the 
protein fi»m the polymer matrix and to provide appropriate hanHling of tiie compositioQ, yet not 
so much that the progenitor cells are prevented fiom infiltrating the matrix, thereby providing the 
protein the opportunity to assist the osteogenic activity of the progenitor cells. In finther 
composifions, proteins orother active ingredients of the invention may be combined with other 

30 ^ents beneficial to the treatment of the bone and/or cartilage defect, woimd, or tissue in 

question. These agents include various grow& &ctors such as epidermal growth &ctor (EGF), 
platelet derived growth fector (PDOF), transforming growth factors (IGF-a and TGF-P), and 
insulin-like growth fector (IGF). 

The therapeutic compositions are also presently valuable for veterinary applications. 

35 Particularly domestic animals and &oroughbred horses, in addition to humans, are desired 
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padents for such treatment with proteins or other active ingredients of tiie present invention. Hie 
dosage tegimen of a protein-containing pharmaceutical composition to be used in tissue 
regeneration will be determined by &e attending physician considering various fiictors which 
modify the action of the proteins, e.g., amount of tissue weight desired to be formed, the site of 
5 damage, the condition of the damaged tissue, the sisse of a wound, type of damaged tissue (e.g., 
bone), the patient's age, sex, and diet, the severity of any infection, time of administration and 
other clinical &ctors. The dosage may vary with the type of matrix used in tibe reconstitution 
and with inclusion of other proteins in the pharmaceutical composition. For example, flie 
addition of other known growth fectors, such as IGF I (insulin like growth &ctor I), to tbe final 

10 composition, may also ^ect the dosage. Progress can be monitored by periodic assessment of 
tissue/bone growth and/or repair, for example. X-rays, histomorphometric determinations and 
tetracycline labeling. 

Polynucleotides of tiie present invention can also be used for gene tiier^y. Such 
polynucleotides can be introduced either in vivo or ex vivo into cells for expression in a 

1 S mammalian subject Polynucleotides of the invention may also be administered by other known 
methods for introduction of nucleic add into a cell or organism including, without limitation, in 
the form of viral vectors or naked DNA). Cells may also be cultured ex vivo in the presence of 
proteins of the present invention in order to proliferate or to produce a desired effect on or 
activify in such cells. Treated cells can then be introduced in vivo for therapeutic purposes. 

20 

4.123 EFFECTIVE DOSAGE 

Pharmaceutical compositions suitable for use in the presoit invention include 
compositions wherein the active ingredients are contained in an effective amotmt to achieve its 
intended purpose. More specifically, a therapeutically ejSective amotmt means an amount 

25 effective to prevent development of or to alleviate the existing symptoms of tiie subject being 
treated. Determination of the effective amount is well witiiin the capability of those skilled in 
the art, especially in light of the detailed disclosure provided herdn. For any compound used in 
the method of the invention, the ther^>eutically effective dose can be estimated initially fit>m 
appropriate in vitro ass^s. For example, a dose can be formulated in animal models to achieve a 

30 circulating coticentration range that can be used to more accurately determine usefiil doses in 
humans. For example, a dose can be formulated in animal models to achieve a circulating 
conc^itration range that includes the ICso as determined in cell culture the concentration of 
the test compound "^^ch achieves a half-maximal inhibition of the protein's biological activity). 
Such information can be used to more accurately determine useful doses in humans. 
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A tfaer^utically e£fective dose refers to that amount of tiie compound that results in 
amelioration of symptoms or a prolongation of survival in a patient Toxid^ and tiierapeutic 
efficacy of sudi compounds can be detemuned by standard pharmaceutical procedures in cell 
cultures or experimental animals, e.g. , for determining the LDso dose lethal to 50% of the 
S population) and the EDso (the dose ther^uticaUy elective in 50% of the population). Hie dose 
ratio between toxic and therapeutic effects is the thor^utic index and it can be expressed as the 
ratio between LDso and EDso- Compounds which exhibit high thra^)eutic indices are preferred. 
The data obtained fiom fliese cell culture assays and animal studies can be used in formulating a 
range of dosage for use in humaiL The do^e of such compounds lies preferably wiUiin a range 
• 1 0 of circulating concentrations that include the EDso with little or no toxicity. The dosage may 

vary within this range depending iqxm the dosage form employed and the route of administration 
utilized. The exact formulation, route of adnoinistration and dosage can be chosen by the 
individual physician in view of the patient's condition. See, e.g., Fingl et aL, 1975, in "The 
Pharmacological Basis of TherEq)eutics'', Ch. 1 p.l. Dosage amount and iaterval may be adjusted 

15 individuaUy to provide plasma levels of the active moiety which are sufficient to maintain the 
desued effects, or miniTnal effective concentratian (MEC). The MEC will vary for each 
compound but can be estimated from in vitro data. Dosages necessary to achieve the MEC will 
depend on individual characteristics and route of administration. However, HPLC assays or 
bioassays can be used to determine plasma concentrations. 

20 Dosage intervals can also be determined using MEC value. Compoimds should be 

administered using a regimen which mftintaina plasma levels above the MEC ibr 1 0-90% of tiie 
time, preferably between 30-90% and most preferably between 50-90%. In cases of local 
administration or selective uptake, the effective local concentration of the drug may not be 
related to plasma con(»QtratioiL 

25 An exemplary dosage regimen for polypeptides or other compositions of tiie invention 

will be in the range of about 0.01 to 100 mg/kg of body weight daily, with tiie preferred 
dose being about 0. 1 (Jg/kg to 25 mg/kg of patient body weight daily, varying in adults and 
children. Dosing may be once daily, or equivalent doses may be delivered at longer or shorter 
intervals. 

30 Theamountof composition adrninistered will, ofcourse, be. dependent on the subject 

being treated, on the subject's age and weight, the severity of tiie afOiction, the manner of 
administration and the judgment of the prescribing physician. 

4.12.4 PACKAGING 
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The compositions may, if desired, be presented in a pack or disp^iser device which may 
contain one or more unit dosage forms containing the active ingredient The pack may, for 
example, comprise metal or plastic foil, such as a blister pack. The pack or dispenser device may 
be accompanied by instructions for administration. Compositions comprising a compound of the 
5 invention formulated in a con^tible pharmaceutical carrier may also be prepared, placed in an 
appropriate container, and labeled for treatment of an indicated condition. 

4.13 ANTIBODIES 

Also included in the invention are antibodies to proteins, or fiagments of proteins of the 

10 invention. The term "antibody" as used herein refers to inomunoglobulin molecules and 

immunologically active portions of immunoglobulin (Ig) molecules, ie., molecules that contain 
an antigen binding site that ^specifically binds ^nmiunoieacts with) an antigen. Such antibodies 
include, but are not limited to, polyclonal, monoclonal, chimeric, single chain. Fab, Fab* and F(aiy)2 
fiagments, and an Fab expression library. In general, an antibody molecule obtained fix)m 

15 humans relates to any of the classes IgG, IgM, IgA, IgE and IgD, vAadt differ from one another 
by the nature of the heavy chain preset in flie molecule. Certain classes have subclasses as well, 
such as IgGi, IgGa, and others. Furfliennore, in humans, the light chain n^y be a kzqjpa chain or 
a lambda chain. Reference hexeia to antibodies includes a reference to all such classes, 
subclasses and types of human antibody species. 

20 An isolated related protein of the invention may be intended to serve as an antigen, or a 

portion or fiagment th^reoj^ and additionally can be used as an immunogen to generate 
antibodies that immunospecifically bind the antigen, using standard techniques for polyclonal 
and monoclonal antibody preparation. The full-length protein can be used or, alternatively, the 
invention provides antigenic peptide fiagments of the antigen for use as iromunogens. An 

25 antigraiic peptide fiagmoit comprises at least 6 amino add residues of the amino acid sequence 
of the flill leogdi protein, (for example the amino add sequence shown in SEQ ID NO: 1351), 
and encompasses an epitope thereof such that an antibody raised against the peptide forms a 
specific inunune complex with the fiill length protein or with any fragment that contains the 
epitope. Preferably, tiie antigenic peptide comprises at least 10 amino add residues, or at least 

30 15 amino acid residues, or at least 20 amino add residues, or at least 30 amino add residues. 
Preferred epitopes encompassed by the antigenic peptide are regions of the protein that are 
located on its surface; commonly these are hydruphilic regions. 

In certain embodiments of the invention, at least one epitope encompassed by the 
antigenic peptide is a region of -related protein that is located on the surface of the protein, e.g., a 

35 hydrqphilic regiorL A hydrophobicity analysis of the human related protein sequaice wiU 
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indicate yiuch regions of a related protein are particularly hydrophilic and, therefore, are likely 

to encode sor&ce residues useiul for targeting antibody production. As a means for tai^geting 

antibody production, hydropathy plots showing regions of hydrophilicity and hydrophobicity 

may be genwated by any method well known in the art, including, for ecample, the Kyte 
5 Doolittle or the Hopp Woods mefliods, eithw with or without Fourier transformation. See, e.g. , 

Hopp and Woods, 1981, Proc. Nat. Acad Sci USA 78: 3824-3828; Kyte and Doolittle 1982, J. 

Mol. Biol. 157: 105-142, each of which is incorporated herein by reference in its entirety. 

Antibodies that are specific for one or more domains within an antigenic protein, or derivatives, 

fragments, analogs or homologs thereof, are also provided herein. 
10 A protein of the invention, or a derivative, fragment, analog, homolog or ortholog 

thereoi^ may be utilized as an immunogen in the generation of antibodies that 

immunospecifically hind these protein components. 

Various procedures known vnthtn the art may be used for the production of polyclonal or 

monoclonal antibodies directed against a protein of the invraition, or against doivatives, 
1 5 fragments, analogs homologs or orthologs thereof (see, for example. Antibodies: A Laboratory 

Manual, Harlow E, and Lane D, 1988, Cold Spring Harbor Laboratory Press, Cold Spring 

Harbor, NY, incorporated herein by reference). Some of these antibodies are discussed below. 

5.13.1 Pollyclonal Antibodies 

20 For the production of polyclonal antibodies, various suitable host animnls (e.g. , rabbit, 

goat, mouse or other mammal) may be immunized by one or more injections with the native 
protein, a synthetic variant thereof, or a d^vative of the foregoing. An appropriate 
immunogenic preparation can contain, for example, the naturally occurring immunogenic 
protein, a chemically synthesized polypeptide representing the inmiunogenic protein, or a 

25 recombinantiy expressed immunogenic protein. Furthermore, the protein may be corrugated to 
a second protein known to be immunogenic in the mammal being inununized. Exan:^>les of such 
immunogenic proteins include but are not limited to keyhole limpet hemocyanin, serum albumin, 
bovine thyroglobulin, and soybean trypsin inhibitor. The prq>aration can iiirther include an 
adjuvant Various adjuvants used to increase the immunological response include, but are not 

30 limited to, Freund's (complete and incomplete), mineral gels (e.g., aluminum hydroxide), surface 
active substances (e.g., lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, 
dinitrophenol, etc.), adjuvants usable in humans such as Bacille Calmette-Guerin and 
Corynebacterium parvum, or similar immunostimulatory agents. Additional examples of 
adjuvants -vAaxih. can be employed include MPL-TDM adjuvant (monophosphoryl Lipid A, 

35 • synthetic trehalose dicorynomycolate). 
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The polyclonal antibody molecules directed against ibs immunogenic protein can be 
isolated from the Tnamtnal (e.g., firom the blood) and further purified by well known techniques, 
such as affinity chromatogr^shy using protein A or protein G, v/iach provide primarily ihe IgG 
fiiaction of immmie serum. Subsequently, or alternatively, the specific antigen which is the 
5 target of the immunoglobulin sought, or an epitope &ereo^ may be immobilized on a column to 
purify the immune specific antibody by immimoafifinity chromatography. Purification of 
immunoglobulins is discussed, for example, by D. Wlkinson (The Scientist, published by The 
Scientist, Inc., Philadelphia PA, Vol. 14, No. 8 (April 17, 2000), pp. 25-28). 

10 5.13^ Monoclonal Antibodies 

The term "monoclonal antibody" (MAb) or "monoclonal antibody conqxjsition", as used 
herein, refers to a population of antibody molecules that contain only one molecular species of 
antibody molecule consisting of a unique light chain gene product and a unique heavy chain 
gene product In particular, the complementarity determining regions (CDRs) of the monoclonal 

1 5 antibody are identical in all tiie molecules of the populatioa MAbs thus contain an antigen 

binding site enable of immunoreacting with a particular qiitope of the antigen characterized by 
a unique binding affinily for it 

Monoclonal antibodies can be prepared using hybridoma methods, such as those 
described by Kohler and Milstein, Nature. 256:495 (1 975). In a hybridoma mefliod, a moiise, 

20 hamster, or other appropriate host animal, is typically immunized with an unmunizing agent to 
elicit lymphocytes that produce or are enable of producing antibodies that will specifically bind 
to the immunizing agent Altranatively, the lymphocytes can be immunized in vitro. 

The imTTi iiniying agent will typically include the protein antigen, a fi-agment Ibereof or a 
fiision protein thraeof Generally, either peripheral blood lymphocytes are used if cells of human 

25 origin are desired, or spleen cells or lymph node cells are used if non-human mammalian sources 
are desired. The lymphocytes are then fused witii an immortalized cell line usmg a suitable 
fiising agent, such as polyethylene glycol, to form a hybridoma cell (Goding, Monoclonal 
Antibodies: Principles and Practice. Academic Press, (1986) pp. 59-103). Lnmortalized cell 
lines are usually transformed mammalian cells, particularly myeloma cells of rodent, bovine and 

30 human origUL Usually, rat or mouse myeloma cell lines are employed. The hybridoma cells can 
be cultured in a suitable culture medium that preferably contains one or more substances that 
mhibit the growth or survival of the unfused, immortalized cells. For example, if the parental 
cells lack the raizyme hypoxanttiiae guanine phosphoribosyl transferase (HGPRT or HPRT), the 
culture medium for the hyfaridomas typically will include hypoxanthine, aminopterin, and 

35 ttiymidine ("HAT medium"), v^diich substances prevent the growth of HGPRT-defident cells. 
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Preferred immoitalized cell lines are those that fuse efBcientty, siqiport stable high level 
expression of antibody by the selected antibody-producing cells, and are sensitive to a mediiun 
such as HAT medium. More preferred innnortalized cell lines are murine myeloma lines, wiash 
can be obtained, for instance, &oin the Salk Institute CeU Distribution Center, San Diego, 
5 California and the American Type Culture CoUection, Manassas, Virginia. Human myeloma and 
mouse-human heteromyeloma cell lines also have been described for the production of human 
monoclorial antibodies (Kozbor, J- rmmnnnl.. 133:3001 (1984); Brodeur et al.. Monoclonal 
Antibody Production Techniques and Applications. Marcel Dekker, Ina, New York, (1987) pp. 
51-63). 

1 0 The culture medium in -which the hybridoma cells are cultured can then be assayed for 

the presence of monoclonal antibodies directed against the antigeiL Preferably, the binding 
specificity of monoclonal antibodies produced by the hybridoma cells is detetmined by 
immunoprecipitation or by an in vitro binding assay, such as radioimmunoassay (RIA) or 
enzyme-linked immunoabsorbent assay (ELISA). Such techniques and assays are known in the 

IS art The binding afBnityofthe monoclonal antibody cati, for example, be detdinined by the 
Rt-atnligrH atialyda nf Mnnsnn and Pnllarrf Anal. RincTiem., 107-.220 (1980^ Preferably, 
antibodies having a high degree of specificity and a high binding affinity for the target antigen 
are isolated. 

After the desired hybridoma ceUs are identified, tiie clones can be subcloned by limiting 
20 dilution procedures and grown by standard methods. Suitable culture media for this purpose 
inchide, for example, Dulbecco's Modified Eagle's Medium and RPMI-1640 medium. 
Alternatively, the hybridoma cells can be grown in vivo as ascites in a m a mm a l . 

The monoclonal antibodies secreted by the subclones can be isolated or purified fix>m the 
culture mediiun or ascites fluid by conventional immunoglobulin purification procedures such 
25 as, for example, protein A-Sepharose, hydroxylapatite chromatography, gel electrophoresis, 
dialysis, or afGnity chromatogr^hy. 

The monoclonal antibodies can also be made by recombinant DNA methods, such as 
those described in U.S. Patent No. 4,816,567. DNA encoding the monoclonal antibodies of tiie 
invention can be readily isolated and sequenced using conventional procedures (e.g., by using 

I 

30 oligonucleotide probes that are c^>able of binding specifically to genes encoding the heavy and 
light chains of murine antibodies). The hybridoma cells of the invention serve as a preferred 
source of such DNA. Once isolated, the DNA can be placed into expression vectors, which are 
then transfected into host cells such as simian COS cells, Chinese hamster ovary (CHO) cells, or 
myeloma cells that do not otherwise produce inmiunoglobulin protein, to obtain the synthesis of 

35 monoclonal antibodies in tiie recombinant host ceUs. The DNA also can be modified, for 
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example, by substhutiiig the coding sequence for human heavy and lig^ chain constant domains 
in place of the homologous murine sequences (U.S. Patent No. 4,816^67; Morrison, Nature 368. 
812-13 (1994)) or by covalently joining to the immimoglobulin coding sequence all or part of the 
coding sequence for a non-immunoglobulin polypeptide. Suchanon-ummmoglobulin 
5 polypeptide can be substituted for the constant domaicts of an antibody of the invention, or can 
be substituted for the variable domains of one antigen-combining site of an antibody of the 
invention to create a chimeric bivalent antibody. 

5.13^ Hmnaiuzed Antibodies 

1 0 The antibodies directed against the protein antigens of the invention can fintiier comprise 

humanized antibodies or human antibodies. These antibodies are suitable for administration to 
humans without engendering an immune response by the human against the administered 
immunoglobulin. Humanized forms of antibodies are chimeric immunoglobulins, 
immunoglobulin chains or fiagments thereof (such as Fv, Fab, Fab', F(ab')2 or other antigen- 

15 binding subsequences of antibodies) that are principally comprised of ite sequence of a hiunan 
immunoglobulin, and contain minimal sequence derived fiom a non-human immunoglobulin. 
Humanization can be performed foUowing the method of Winter and co-wotkers (Jones et al.. 
Nature. 321:522-525 (1986); Riechmann et al.. Nature. 332:323-327 (1988); Verhoeyen et al.. 
Science. 232:1534-1536 (1988)), by substituting rodent CDRs or CDR sequences for the 

20 corresponding sequences of a hmnan antibody. (See also U.S. Patent No. 5,225,539.) In some 
instances, Fv framework residues of the human immunoglobulin are replaced by corresponding 
non-human residues. Humanized antibodies can also comprise residues which are found neither 
in the recipient antibody nor in the imported CDR or framework sequences. In general, the 
humanized antibody will comprise substantially all of at least one, and typically two, variable 

25 domains, in which all or substantially all of the CDR regions correspond to those of a non-human 
immunoglobulin and all or substantially all of ihe framework regions are those of a hiunan 
immunoglobulin consensus sequence. The humanized antibody optimally also will comprise at 
least a portion of an inununoglobulin constant region (F c), typically that of a human 
iiomunoglobulin (Jones et al., 1986; Riechmaim et aL, 1988; and Presta, Curr. Op. Struct Biol.. 

30 2:593-596 (1992)). 

5.133 Human Antibodies 

Fully human antibodies relate to antibody molecules in which essentially the entire 
sequences of both the light chain and the heavy diain, including the CDRs, arise from human 
35 genes. Such antibodies are termed '%unian aritibodies", or '"fiiUy huiiian aiitibodies" hereiii. 
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Human monoclonal antibodies can be prepared by the trioma tediniqae; die human B-oell 
hyhiidoma technique (see Kozbor, et al., 1983 Immunol Today 4: 72) and the EBV hybridoma 
technique to produce human monoclonal antibodies (see Cole, et al., 1985 In: MONOCLONAL 
ANTIBODIES AKD CANCER THERAPY, Alan R. Liss, Inc., pp. 77-96). Human monoclonal 
5 antibodies may be utilized in Hbs practice of the present invention and may be produced by using 
human hybridomas (see Cote, et al., 1983. Proc Natl Acad Sci USA 80: 2026-2030) or by 
transforming human B-cells wifli Epstein Barr Virus in vitro (see Cole, et aL, 1985 In: 
Monoclonal Antibodies and Cancer Therapy, Alan R, Liss, Inc., pp. 77-96). 

In addition, human antibodies can also be produced using additional techniques, 

10 including phage display libraries (Hoogenboom and \Wnter, J. Mol. Biol.. 227:381 (1991); 
Marks et al., J. Mol. Biol.. ^:581 (1991)). Similarly, human antibodies can be made by 
introducing human inmiunoglobulin loci into transgenic animals, e.g., mice in which the 
endogenous inununoglobulin genes have been partially or cornpletely inactivated. Upon 
chaUenge, himian antibody production is observed, which closely resembles diat seen in humans 

15 in all respects, including g^ie rearrangement, assembly, and antibody repertoire. Hus approach 
is described, for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 
5,633,425; 5,661,01 6, and in Marks et al. fBio/Technologv 10, 779-783 (1992)); Lonberg et al, 
fl^ature 368 856-859 (1994)); Morrison ( Nature 368. 812-13 (1994)); Fishwild et al,( Nature 
Biotechnology 14. 845-51 (199€)\. Neubereer rNature Biotechnoloev 14. 826 (1996)); and 

20 Lonberg and Huszar Ontem. Rev - Tmrnimnl 13 65-93 (1995)). 

Human antibodies may additionally be produced using transgenic nonhuman animalR 
v^ch are modified so as to produce fully human antibodies rather than die animal's endogenous 
antibodies in response to challenge by an antigen. (See PCT publication WC)94/02602). The 
endogenous genes encoding the heavy and light immunoglobulin chains in the nonhuman host 

25 have been incapacitated, aiKl active loci encoding human heavy and light chain immunoglobulins 
are inserted into the host's genome. The human genes are incorporated, for esxample, using yeast 
artificial chromosomes containing the requisite human DNA segments. An animal which 
provides all the desired modificaticms is then obtained a3 progeny by crossbreeding intoDmediate 
transgenic animals containing fewer than the fiill complement of Ibe modifications. The 

30 preferred emboiUment of such a nonhuman animal is a mouse, and is termed tiie Xenomouse™ 
as disclosed in PCT publications WO 96/^3735 and WO 96/34096. This animal produces B 
cells whidi secrete fiilly human immuix)globulins. The antibodies can be obtained directly from 
die animal after immunization with an immunogen of interest, as, for exanqile, a preparation of a 
polyclonal antibody, or alternatively from immortalized B cells derived fiom the animal, sudi as 

35 hybridomas producing monoclonal antibodies. Additionally, die genes encoding the 



79 



wo 01/57188 



PCTAJSOl/03800 



immimoglobulins with Human variable regions can be recovered and e^qnessed to obtain the 
antibodies directly, or can be further modified to obtain analogs of antibodies such as, for 
example, single chain Fv molecules. 

An example of a method of producing a nonbuman host, exemplified as a mouse, lacking 
5 expression of an endogenous immunoglobulin heavy chain is disclosed in U.S. Patent No. 
'5,939,598. It can be obtained by a method including deleting the J segment genes fixjm at least 
one endogenous heavy chain locus in an embryonic stem cell to prevent rearrangement of flie 
locus and to prevent formation of a transCT^ of a rearranged immunoglobulin heavy chain locus, 
the deletion being effected by a targeting vector containing a gene encoding a selectable marken 

10 and producing firom the embryonic stem cell a transgenic mouse vsiiose somatic and genn cells 
contain the gene encoding the selectable marljser. 

A method for produdng an antibody of interest, such as a hunoan antibody, is disclosed in 
U.S. Patent No. 5,916,771. It includes introducing an expression vector that contains a 
nucleotide sequence encoding a heavy chain into one tnamiri^liaii host cell in culture, introducing 

15 an expression vector containing a nucleotide sequence encoding a light chain into another 

m a mm a li an host cell, and fusing the two cells to form a hybrid cell. The hybrid cell eiqiresses an 
antibody containing the heavy chain and the light chain. 

In a further improvement on this procedure, a method for identifying a clinically relevant 
epitope on an immunogen, and a conelative method for selecting an antibody that binds 

20 immunospecifically to tiie relevant epitope with high affinity, are disclosed in PCX publication 
WO 99/53049. 

5.13.4 F.b Fragments and Sin^e Chain Antibodies 

According to the invention, techniques can be adapted for the production of single-chain 
25 antibodies specijGc to an antigenic protein of the invention (see e.g., U.S. Patent No. 4,946,778). 
In addition, methods can be adapted for the construction of Fab e:q>ression libraries (see e.g., 
Huse, et al., 1989 Science 246: 1275-1281) to allow rapid and effective identification of 
monoclonal Fab fragments with the desired specificity for a protein or derivatives, fiagments, 
analogs or homologs thereof. Antibody firagments that contain the idiotypes to a protein antigen 
30 may be produced by techniques known in the art including, but not limited to: (i) an .F(ab72 

fragment produced by pepsin digestion of an antibody molecule; (ii) an Fab firagment generated 
by reducing the disulfide bridges of an F(affi fragment, (iii) an Fia, fisgment genaated by the 
treatment of the antibody molecule witii papain and a reducing agent and (iv) Fy fi:agments. 

35 5.13.5 Bispecific Antibodies 
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Bispecific antibodies are monoclonal, preferably human or humanized, antibodies that 
have binding specificities for at least two different antigens. In the present case, one of tiie 
binding specificities is for an antigenic protein of the invention. The second binding target is any 
otiier antigen, and advantageously is a ceU-sur&ce protein or receptor or receptor subunit 
5 Methods for making bispecific antibodies are kno^ in the art Traditionally, the 

recombinant production of bispecific antibodies is based on the co-e;qjression of two 
immunoglobulin heavy-<diain/light-chain pairs, where the two heavy diains have different 
specificities (Milstein and Cuello, Nature. 305:537-539 (1983)). Because of the random 
assortment of immunoglobulin heavy and light chains, these hybridomas (quadromas) produce a 

10 potential mixture of ten different antibody molecules, of ^liich only one has the correct 

bispecific structure. The purification of the correct molecule is usually accomplished by affinity 
chromatography steps. Similar procedures are disclosed in WO 93/08829, published 13 May 
1993, and in Traunecker et al., 1991 EMBO J., 10:3655-3659. 

Antibody variable domains with the desired binding specificities (antibody-antigen 

15 combining sites) can be fiised to immunoglobulin constaiit domain sequences. Thefiision 

preferably is with an immimoglobulin heavy-chain constant domain, comprising at least part of 
the hinge, CH2, and CEB regions. It is preferred to have the first heavy-chain constant region 
(CHI) containing the site necessary for light-chain binding present in at least one of the fiisions. 
DNAs encoding the immunoglobulin heavy-chain fusions and, if desired, the immunoglobulin 

20 light chain, are inserted into separate expression vectors, and are co-transfected into a suitable 
host organism. For further details of generating bispecific antibodies see, for example, Suresh et 
al.. Methods in F.n7: vmologv. 121:210 (1986). 

According to another approach described in WO 96/2701 1, the interface between a pair 
of antibody molecules can be engineered to Tnayimig« the percentage of heterodimers which are 

25 recovered fiom recombinant cell culture. The preferred inter&ce comprises at least apart of the 
CH3 region of an antibody constant domaiiL In this method, one or more small amino acid side 
chains fix>m the interface of the first antibody molecule are rqplaced with larger side chains {e.g. 
tyrosine or tryptophan). Compensatory "cavities" of identical or similar size to the large side 
chain(s) are created on the inter&ce of the second antibody molecule by replacing large amino 

30 acid side chains with smaller ones (e.g. alanine or tiueonine). This provides a mechanism for 
increasing the yield of the heterodimex over other unwanted end-products such as homodimers. 

Bispecific antibodies can be prepared as iuU length antibodies or antibody fi::^pieDts (e.g. 
F(ab')2 bispecific antibodies). Techniques for generating bispecific antibodies fix>m antibody 
fiagments have been described in the literature. For example, bispecific antibodies can be 

35 prepared usmg chemical linkage. Brennan et al.. Science 229:81 (1985) describe a procedure 
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wherein intact antibodies are proteolyticaUy cleaved to generate F(ab')2 fragmaits. These 
fiagments are reduced in die presence of the dithiol complexing agent sodium aisenite to 
stabilize vicinal ditMols and prevent intennolecular disulfide fonnation. The Fab* firagments 
generated are then converted to thionitrobenzoate (TNB) derivatives. One of the Fab'-TNB 
5 derivatives is then reconverted to tbe Fab'-thiol by reduction with ma^toethylamine and is 
mixed with an equimolar amount of the other Fab'-TNB derivative to form the bispecific 
antibody. The bispecific antibodies produced can be used as agents for ibs selective 
immobilization of enzymes. 

Additionally, Fab' fiagments can be directly recovered fix)m E. coll and chemically 

10 coiqpled to form bispecific antibodies. Shalabv et al.. J. Exp. Med. 175:217-225 (1992'> describe 
the production of a fiilly humanized bispecific antibody P(ab')2 molecule. Each Fab' fiagment 
was separately secreted fixtm E. coli and subjected to directed chemical coupling 'in vitro to form 
tbe biq)eci£c antibody. The bispecific antibody tims formed was able to bind to cells 
overejqnessing the ErbB2 receptor and normal human T cells, as well as trigger the lytic activity 

IS of human cytotoxic lymphocytes against human breast tumor targets. 

Various techniques for making and isolating bispecific antibody fiagments directly fiom 
recombinant cell culture have also been described. For example, bispecific antibodies have been 
produced using leucme zippers. Kostehiy etaL, jL^m^oL 148(5):1547-1553 (1992). The 
leucine zipper peptides firom the Fos and Jun proteins were linked to the Fab' portions of two 

20 different antibodies by gene fiision. The antibody homodimers were reduced at the hinge region 
to form monomers and then re-oxidized to ibrm fhe antibody heterodimers. This method can 
also be utilized for the production of emtibody homodimeis. The "diabody" technology 
described by Hollinger et al., Proc. Natl. Acad. Sd. USA 90:6444-6448 (1993) has provided an 
ahemative mechanism for making bispecific antibody fingments. The fragments comprise a 

25 heavy-chain variable domain (Vh) connected to a light-chain variable domain (Vl) by a linker 
v^ch is too short to allow pairing between the two domains on the same chain. Accordingly, 
tile Vh and Vl domains of one fiiagment are forced to pair with the complementary Vl and Vh 
domains of another fragment, tiier^y forming two antigen-binding sites. Another strategy for 
making bispecific antibody fragments by the use of single-chain Fv (sFv) dimers has also been 

30 reported. See, Grabear et al., J. Immunol. 152:5368 (1994). 

Antibodies with more than two valencies are contemplated. For eaaaaple, trispecific 
antibodies can be prepared. Tutt et al.. J. Immunol. 147:60 (199n. 
Exemplary bispecific antibodies can bind to two different epitopes, at least one of which 
originates in the protein antigen of the invention. Alternatively, an anti-antigenic arm of an 

35 immunoglobulin molecule can be combmed with an arm which binds to a triggering molecule on 
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a leukocyte such as a T-cell receptor molecule ie.g. CD2, CD3, CD28, or B7), or Fc receptors for 
IgG(Fc R),suchasFc RI(CD64),Fc RII (CD32) and Fc Rin(CD16)soastofocus 
cellular defense mechanisms to the cell expressing &e particular antigen. Bispecific antibodies 
can also be used to direct cytotoxic agents to cells which e^qiress a particular antigai. Tbese 
5 antibodies possess an antigen-binding arm and an ama viiiich binds a cytotoxic agent or a 

radionuclide chelator, such as EOTUBE, DPTA, DOTA, or TETA. Another bispedfic antibody 
of interest binds the protein antigen described herein and fiirther binds tissue &ctor (TF). 

5.13.6 Heteroconjugate Antibodies 

1 0 Heteroconjugate antibodies are also within the scope of the present invention. 

Heteroconjugate antibodi^ are composed of two covalently joined antibodies. Such antibodies 
have, for example, been proposed to target immime system cells to unwanted cells (U.S. Patent 
No. 4,676,980), and for treatment of HIV infection (WO 91/00360; WO 92/200373; EP 03089). 
It is contemplated that the antibodies can be prepared in vitix> using known methods in synthetic 

IS protein chemistry, including those involving crosslinking agents. For exanq)le, immimotoxins 
can be constructed using a disulfide exchange reaction or by forming a thioetiier bond. 
Examples of suitable reagents for tibis purpose include iminothiolate and methyl-4- 
mercaptobutyiimidate and those disclosed, for example, in U.S. Vateat No. 4,676,980. 

20 5.13.7 Effector Function Engineering 

It can be desirable to modify the antibody of the invention with req)ect to effector fimction, so as 
to enhance, e.g., the effectiveness of tiie antibody in treating cancer. For example, cysteine 
residue(s) can be introduced into the Fc region, thereby allowing interchain disulfide bond 
jformation in this region. The homodimeric antibody tiius generated can have improved 

25 intetnalization capability and/or increased complement-mediated cell kiUing and antibody- 
dependent cellular cytotoxicity (ADCC). See Caron et al., J. Bxp Med., 176: 1191-1195 (1992) 
and Shopes, J. bnmunol., 148: 2918-2922 (1992). Homodimeric antibodies with enhanced anti- 
tumor activity can also be prepared using heterobifimctional cross-linkers as described in Wolff 
et al. Cancer Research, 53: 2560-2565 (1993). Altematively, an antibody can be engLaeered that 

30 has dual Fc regions and can thereby have enhanced complement lysis and ADCC capabilities. 
See Stevenson et al., Anti-Cancer Drug Design, 3: 219-230 (1989). 

5.13.8 Immonoconjugates 

The invention also pertains to immunoconjugates comprising an antibody conjugated to a 
35 cytotoxic agent such as a chemotherapeutic agent, toxin (&g., an enzymatically active toxin of 
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bacterial, fimgal, plant, or animal origin, or fiagments thereof), or a radioactive isotope (i.e., a 
radioconjugate). 

Chemotherapeutic agents useful in tbie generation of such immunoconjugates have been 
described above. Enzymatically active toxins and fragments thereofthat can be used include 
S diphdieria A chain, nonbinding active fragments of diphtheria toxin, exotoxin A chain (from 
Pseudomonas aeruginosa), ricin A chain, abrin A chain, modeccin A chain, alpha-sarcin, 
Aleurites fbnlii proteins, dianthin proteins, Phytolaca americana proteins (PAPI, PAFII, and 
PAP-S), momordica charantia inhibitor, curcin, ootin, sapaonaria officinalis inhibitor, gelonin, 
mitogellin, restrictocin, phenomydn, enomycin, and the tricotfaecenes. A variety of 
10 radionuclides are available for ihe production of radioconjugated antibodies. Examples include 

Conjugates of &e antibody and cytotoxic agent are made using a variety of bifimctional 
protein-coupling agents such as N-succinimidyl-3-(2-pyridyldithiol) propionate (SPDP), 
iminofhiolane (IT), bifunctional derivatives of imidoesters (such as dimethyl adipimidate HCL), 

15 active esters (such as disuccininudyl suberate), aldehydes (such as glutareldehydeX bis-azido 
compounds (such as bis Q>-azidobenzoyl) hexanediamine), bis-diazonium derivatives (such as 
bis-(p^azoniumbenzoyl)-ethylenedianiine), diisocyanates (such as tolyene 2,6-diisocyanate), 
and bis-active fluorine compounds (such as l,5-difluoro-2,4-dinitrobeaQzene). For example, a 
ricin immunotoxin can be prepared as described in Vitetta et al.. Science, 238: 1098 (1987). 

20 Carbon-14-labeled l-isothiocyanatobenzyl-3-mediyldiethylene triaminepentaacetic add (MX- 
DTPA) is an exemplary chelating agent for conjugation of radionucleotide to the antibody. See 
WO94/11026. 

In anodier embodiment, the antibody can be conjugated to a "receptor" (such 
streptavidin) for utilization in tumor pretargeting vt^erein the antibody-receptor conjugate is 
25 administered to the patient, followed by removal of unboimd conjugate &om the circulation 
using a clearing agent and ihien administration of a "ligand" (e.g., avidin) that is in turn 
conjugated to a cytotoxic agent 

4.14 COMPUTER READABLE SEQUENCES 

30 In one application of this embodiment, a nucleotide sequence of the present invention can 

be recorded on computer readable media. As used herein, "computer readable media" refers to 
any medium Tvhidi can be read and accessed directly by a computer. Such media include, but 
are not limited to: magnetic storage media, such as floppy discs, hard disc storage medium, and 
magnetic t^; optical storage media such as CD-ROM; electrical storage media such as RAM 

35 and ROM; and hybrids of these categories such as magnetic/optical storage media. A skilled 
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artisan can reaiMly aiipieciate how any of the presently known computer readable mediums can 
be used to create a manu&cture compiising computer readable medium having recorded tbeieon 
a nucleotide sequence of flie present invention. As used herdn, "recorded" refers to a process for 
storing infomiation on computer readable mediimi. A skilled artisan can readily adopt any of tiie 
5 presently known methods for recording information on computer readable medium to generate 
manu&ctures comprising the nucleotide sequence information of the present invention. 

A variety of data storage structures are available to a skilled artisan for creating a 
computer readable medium having recorded thereon a nucleotide sequence of tiie present 
invention. The choice of the data storage structure will generally be based on tiie means chosen 

10 to access the stored information. In addition, a variety of data processor programs and formats 
can be used to store the nucleotide sequence information of the present invention on computer 
readable medium. Hie sequence information can be represented in a word processing text file, 
formatted in commercially-available software such as WordPerfect and Microsoft Word, or 
represented m the form of an ASCH file, stored in a database application, such as DB2, Sybase, 

IS Oracle, or the like. A skilled artisan can readily adapt any number of data processor structuring 
formats (e.g. text file or database) in order to obtain compute readable medium having recorded 
thereon the nucleotide sequence iiiformation of tiie present invention. 

By providing any of the nucleotide sequeaces SEQ ID NO:1-1350 or a representative 
fiagment thereof or a nucleotide sequence at least 95% identical to any of the nucleotide 

20 sequences of SEQ ID NO:1-1350 m computer readable form, a skilled artisan can routinely 
access flie sequence information for a variety of purposes. Computer software is publicly 
available which allows a skilled artisan to access sequence information provided in a computer 
readable medium. Hie examples ^\iiich fi)llow demonstrate how software which implements the 
BLAST (Ahschul et al., J. MoL Biol. 215:403-410 (1990)) and BLAZE (Brutiag et al., Comp. 

25 Chem. 17:203-207 (1993)) search algorithms on a Sybase system is used to identify open reading 
frames (ORFs) within a nucleic acid sequence. Such OKFs may be protein encoding fiagments 
and may be useful in producing commercially important proteins such as en^mes used in 
fermraitation reactions and in the production of commercially use&l metabolites. 

As used herein, "a computer-based system" refers to the hardware means, software 

30 means, and data storage means used to analyze the nucleotide sequence information of tiie 

present invention. "Hie minimum hardware means of the comput^-based systems of tibie present 
invention comprises a central processing unit (CPU), input means, ou^ut means, and data 
storage means. A skilled artisan can readily ^preciate that any one of the currentiy available 
computer-based systems are suitable for use in the present invention. As stated above, the 

35 computer-based systems of the present invention comprise a data storage means having stored 
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tfaerein a nucleotide sequence of the present invention and the necessary hardware means and 
software means for s<q)porting and implementing a search means. As used herein, "data storage 
means" refers to memory which can store nucleotide sequence information of the present 
invention, or a memory access means \^ch can access manufactures having recorded thereon 
S the nucleotide sequence infisimation of the present invention. 

As used herein, "search means" refers to one or more programs i^ch are implemented 
on the computer-based system to compare a target seqiience or target structural motif witii the 
sequence information stored witiiin the data storage means. Search means are used to identify 
fiagments or regions of a known sequence vMch match a particular target sequence or target 

1 0 motif. A variety of known algorithms are disclosed publicly and a variety of commercially 
available software for conducting search means are and can be used in tiie computer-based 
systems of tiie present inventioiL Examples of such software includes, but is not limited to, 
Smitih-Waterman, MacPattem (EMBL), BLASTN and BLASTA (NPOLYPEPTIDEIA). A 
skilled artisan can readily recognize that any one of the available algorithms or implementing 

15 software packages for conducting homology searches can be ad^tted for use in the present 

computer-based systems. As used herein, a "target sequence" can be any nucleic acid or amino 
acid sequence of six or more nucleotides or two or more amino adds. A skilled artisan can 
readily recognize that the longer a target seqiience is, the less likely a target sequence will be 
present as a random occurrence in the database. The most preferred sequence length of a target 

20 sequence is from about 10 to 300 amino acids, more preferably &om about 30 to 100 nucleotide 
residues. However, it is weU recognized tiiat searches for conunerdally importartt fiagments, 
such as sequence fiagments involved in gene expression and protein processing, may be of 
shorter length. 

As used herein, "a target structural motif," or "target motif," refers to any rationally 
25 selected sequence or combination of sequences in which the sequence(s) are chosen based on a 
three-dimensional configuration which is formed iqwn the folding of the target motif. There are 
a variety of target moti& known in the art. Protein target moti& include, but are not limited to, 
enzyme active sites and signal sequences. Nucleic acid target moti& include, but are not limited 
to, promoter sequences, hairpin structures and indticible expression elements (protein binding 
30 sequences). 

4.15 TRIPLE HELIX FORMATION 

In addition, the fiagments of the present invention, as broadly described, can be used to 
control gene oqpression through triple helix formation or antisense DMA or RNA, both of which 
35 metiiods are based on the binding of a polynucleotide sequraice to DNA or KNA. ' 
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Polynucleotides suitable for use in these methods are preferably 20 to 40 bases in length and are 
desig^ed to be complementary to a region of the gene involved in transcrqition (triple helix - see 
Leeetal.,Nu&l. Adds Res. 6:3073 (1979); Cooney etal.. Science 15241:456 (1988); andDervan 
et al.. Science 251 :1360 (1991)) or to the mRNA itself (antisense - Olmno, J. Neurochem. 
5 56:560 (1991); Oligodeoxynucleotides as Antisense Inhibitors of Gene Egression, CRC Press, 
Boca Raton, FL (1988)). Triple helix-formation optimally results in a shut-off of RNA 
transcription from DNA, v/biie antisense KNA hybridization blocks translation of an mRNA 
molecule into polypeptide. Both techniques have been demonstrated to be effective in model 
systems. Information contained in the sequences of the present invention is necessary for the 
10 design of an antisense or triple helix oligonucleotide. 

4.16 DIAGNOSTIC ASSAYS AND KITS 

The present invention fiirther provides methods to identify the presence or expression of 
one of tiie ORFs of the present invention, or homolog thereof, in a test sample, using a nucleic 

1 5 acid probe or antibodies of the present invention, optionally conjugated or otherwise associated 
with a suitable label. 

In general, methods for detecting a polynucleotide of the invention can comprise 
contacting a sample with a compound that binds to and forms a complex with the polynucleotide 
for a period sufiBcieht to fonn the complex, and detecting the complex, so ihsX if a complex is 

20 detected, a polynucleotide of the invention is detected La the sanq>le. Such methods can also 
comprise contacting a sample under stringent hybridization conditions with nucleic acid primers 
that anneal to a polynucleotide of the invention under such conditions, and amplifying annealed 
polynucleotides, so that if a polynucleotide is amplified, a polynucleotide of the invention is 
detected in tiie sample. 

25 In general, methods for detecting a polypeptide of the invention can comprise contacting 

a sample with a conqmund that binds to and forms a complex with the polypeptide for a period 
sufficient to form the complex, and detecting the complex, so that if a complex is detected, a 
polypeptide of fhe invention is detected in tiie sample. 

In detail, sudi methods comprise incubating a test sample with one or more of the 

30 antibodies or one or more of the nucleic acid probes of the present invention and assaying for 
binding of the nucleic acid probes or antibodies to components within the test sample. 

Conditions for incubating a nucleic add probe or antibody with a test sample vary. 
Incubation conditions depend on the format eaaployed in the assay, the detection methods 
employed, and the type and nature of the nucleic add probe or antibody used in the assay. One 

35 stalled in the art will recognize that any one of fhe commonly available hybridization. 
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amplificatioii or umnunolo^cal assay formats can readily be adzqited to employ the nucleic acid 
probes or antibodies of the present tnvention. Examples of such assays can be found in Chard, 
T., An Introduction to Radioimmunoassay and Related Techniques, Elsevier Sdence Publishers, 
Amsterdam, The Netherlands (1986); Bullock, GJR. et al.. Techniques in Immunocytochemistry, 
5 Academic Press, Orlando, FL Vol. 1 (1982), VoL 2 (1983), VoL 3 (1985); Tijssen, P., Practice 
and Hieoiy of immunoassays: Laboratory Techniques in Biodiemistry and Molecular Biology, 
Elsevier Science Publishers, Amsterdam, The Netherlands (1985). The test samples of the 
present invention include cells, protera or membrane retracts of cells, or biological fluids such as 
spiitum, blood, serum, plasma, or urine. The test sample used in the above-described method 

10 will vary based on the assay format, nature of the detection method and the tissues, cells or 

extracts used as the sample to be assayed. Methods for preparing {notein extracts or membrane 
extracts of cells are well known in the art and can be readily be ad^ted in order to obtain a 
sample vAdch is compatible with the system utilized. 

In another embodiment of the present invraition, kits are provided ■vAach contain the 

1 5 necessary reagents to carry out the assays of the present invention. Specifically, the invention 
provides a compartment kit to receive, in close confinement, one or more containers °v«4iich 
comprises: (a) a first container compriang one of the probes or antibodies of the present 
invention; and (b) one or more other containers comprising one or more of the following: vrash 
reagents, reagents capable of detecting presence of a bound probe or antibody. 

20 In detail, a compartment kit includes any kit in 'n^ch reagents are contained in separate 

containers. Such containos include small glass containers, plastic containers or strips of plastic 
or p^>er. Such containers allows one to efBciently transfer reagents fiom one compartment to 
another compartment such that the sanq)les and reagents are not cross-contaminated, and the 
agents or solutions of each container can be added in a quantitative fashion fix>m one 

25 compartment to another. Such containers will include a container which will accept the test 
sample, a container which contains the antibodies used in the assay, containers which contain 
wash reagents (such as phosphate bufifered saline, Tris-bufEers, etc.), and containers which 
contain the reagents used to detect the bound antibody or probe. Types of detection reagents 
include labeled nucleic acid probes, labeled secondary antibodies, or in tiie alternative, if the 

30 primary antibody is labeled, the enzymatic, or antibody binding reagents which are capable of 
reacting with the labeled antibody. One skilled in the art will readily recognize that the disclosed 
probes and antibodies of flie present invention can be readily incorporated into one of the 
established kit formats which are well known in the art. 

35 4.17 MEDICAL IMAGING 

8S 
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The novel polypeptides and binding partners of the invention are useful in medical 
imaging of sites e}q»cessing the molecules of the invention (e.g., vihete the polypeptide of the 
invention is involved in tiie immune response, for imaging sites of inflammation or infection). 
See, e.g., Kunkel et al., U.S. Pat NO. 5,413,778. Such mefliods involve chranical attachment of 
5 a labeling or imaging agent, administration of the labeled polypeptide to a subject in a 

pharmaceutically acceptable carrier, and imaging the labeled polypeptide in vivo at the target 
site. 

4.18 SCREENING ASSAYS 
10 Using the isolated proteins and polynucleotides of the invention, the present invention 

fiirtfaer provides metiiods of obtaining and identifying agents which bind to a polypeptide 

encoded by an ORF corresponding to any of tiie nucleotide sequences set forth in SEQ ED NO:l- 

1350, or bind to a specific domara of the polypeptide encoded by tiie nucleic add. In detail, said 

method comprises the steps of: 
1 5 (a) contacting an agent with an isolated protein encoded by an ORF of the present 

invention, or nucleic acid of the invention; and 

(b) detennining whether the agent binds to said protein or said nucleic add. 
In general, therefore, such methods for identifying compounds tiiat bind to a 

polynucleotide of the invention can comprise contacting a compound with a polynucleotide of 
20 the invention for a time sufScient to form a polynucleotide/compound complex, and detecting 

tiie complex, so that if a polynucleotide/compound complex is detected, a compound that binds 

to a polynucleotide of the invention is identified. 

Likewise, in general, therefore, such methods for identifying compounds that bind to a 

polypeptide of the invention can comprise contacting a compound with a polypeptide of the 
25 invention for a time sufficient to form a polypeptide/compound complex, and detecting the 

complex, so that if a polypeptide/compound con^ilex is detected, a compound that binds to a 

polynucleotide of the invention is identified. 

Methods for identifying compounds that bind to a polyp^tide of the invention can also 

comprise contacting a compound with a polypeptide of the invention in a cell for a time 
30 sufficient to form a polypeptide/compound complex, wiierein ihe complex drives expression of a 

receptor gene sequence in the cell, and detecting the complex by detecting reporter gene 

sequence e}q)ression, so that if a polypeptide/compound complex is detected, a compoimd tiiat 

binds a polypeptide of the invention is identified. 

Comi>ounds identified via such methods can include conqxjunds winch modulate the 
35 activity of a polypeptide of tiie invention (that is, increase or decrease its activity, relative to 
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activity observed in the absence of tbe compound). Alternatively, compounds identified via such 
mediods can include conipounds whidi modulate the expression of a polynucleotide of the 
invention (that is, increase or deoease expression relative to expression levels observed in the 
absence of the compound). Compounds, such as compounds identified via the methods of the 
S invention, can be tested using standard assays well known to those of sldll in the art for their 
abili^ to modulate activity/expression. 

The agents screened in the above assay can be, but ate not limited to, peptides, 
carbohydrates, vitamin derivatives, or otiier pharmaceutical agents. Hie agents can be selected 
and screened at random or rationally selected or designed using protein modeling techniques. 

10 For random screening, agents such as peptides, catbohydrates, pharmaceutical agents and 

the like are selected at random and are assayed lor tiieir ability to bind to the protein encoded by 
the ORF of the present invention. Alternatively, agents may be rationally selected or designed. 
As used herein, an agent is said to be "rationally selected or designed" when the agent is chosen 
based on the configuration of the particular protein.' For example, one sldlled in the art can 

IS readily adapt currentiy available procedures to generate peptides, pharmaceutical agents and the 
like, c^jable of binding to a specific peptide sequence, in order to generate rationally designed 
antipeptide peptides, for example see Hurby et al.. Application of Synthetic Peptides: Antisense 
Peptides," In Syntiietic Peptides, A User's Guide, WJI. Freeman, NY (1992), pp. 289-307, and 
Kaspczak et al., Biochranistiy 28 :9230-8 (1 989), or pharmaceutical agents, or tiie like. 

20 In addition to the foregoiag, one class of agents of die jnesent invention, as broadly 

described, can be used to control gene expression through binding to one of the OKFs or EMFs 
of the present invention. As desmbed above, such agents can be randomly screened or 
rationally designed/selected. Targeting the OKF or EMF allows a skilled artisan to design 
sequence specific or element specific agents, modulating the expression of either a single ORF or 

25 multiple ORFs vdiich rely on tiie same EMF for expression control. One class of DNA binding 
agents are agents which contain base residues wiiich hybridize or form a trq>le helix formation 
by binding to DNA or KNA. Such agents can be based on the classic phosphodiester, 
ribonucleic acid backbone, or can be a variety of suLQiydryl or polymeric derivatives winda. have 
base attachment c^>acity. 

30 Agents suitable for use in these metiiods preferably contain 20 to 40 bases and are 

designed to be complementary to a region of the gene involved in transcription (triple helix - see 
Lee et al., Nucl. Acids Res. 6:3073 (1979); Cooney et al.. Science 241 :456 (1988); and Dervan et 
aL, Science 251:1360 (1991)) or to tiie mRNA itself (antisense - Okano, J. Neurochem. 56:560 
(1991); Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, Boca 

35 Raton, FL (1988)). Triple helix-formation optimally results in a shut-off of RNA transcription 
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from DNA, while antisense KNA hybridization blocks translation of an mRNA molecule into 
polypeptide. Both techniques have been demonstrated to be effective in model systems. 
Information contained in the sequences of the present invention is necessary foi the design of an 
antisense or triple helix oligonucleotide and other DNA binding agents. 

Agents v^ch bind to a protein encoded by one of the ORFs of the present mvention can 
be used as a diagnostic agent Agents which bind to a protein encoded by one of the ORFs of the 
present invention can be formulated using known techniques to generate a pharmaceutical 
composition. 

4.19 USE OF NUC3LEICACIDS AS PROBES 

Another aspect of the subject invention is to provide for polypeptide-specific nucleic add 
hybridization probes capable of hybridizing with naturally occurring nucleotide sequences. The 
hybridization probes of the subject invention may be derived firom any of the nucleotide 
sequences SEQ ID NO:1-1350. Because the corresponding gene is only expressed in a limited 
number of tissues, a hybridization probe derived from of any of the nucleotide sequences SEQ 
ED NO:1-1350 can be used as an mdicator of the presence of RNA of cell type of such a tissue in 
a sample. 

Any suitable hybridization technique can be employed, such as, for example, in situ 
hybridization. PGR as described in US Patents Nos. 4,683,195 and 4,965,188 provides 
additional uses for oligonucleotides based upon the nucleotide sequences. Such probes used in 
PGR may be of recombinant origin, may be chemically syntiiesized, or a mixture of both. The 
probe will comprise a discrete nucleotide sequence for the detection of identical sequences or a 
degenerate pool of possible sequences for identification of closely related gmmiic sequences. 

Other means for producing specific hybridization probes for nucleic adds indude the 
cloning of nucleic add sequences into vectors for the production of mRNA probes. Such vectors 
are known in the art and are commercially available and may be used to synthesize RNA probes 
in vitro by means of the addition of the ^propriate RNA polymerase as T7 or SP6 RNA 
polymerase and the appropriate radioactively labeled nucleotides. The nucleotide sequences may 
be used to construct hybridization probes for mapping their respective genomic sequences. The 
nucleotide sequence provided herein may be m^jped to a chromosome or specific regions of a 
chromosome using well known genetic and/or chromosomal mapping techniques. These 
techniques include in situ hybridization, linkage analysis against known chromosomal markers, 
hybridization screening with libraries or flow-sorted chromosomal preparations specific to 
known cluomosomes, and the like. The technique of fluorescent in situ hybridization of 
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chiomosome spieads has been described, among other places, in Venna et al (1988) Human 
Cbromosomes: A Manual of Basic Techniques, Pergamon Press, New York NY. 

Fluorescent in situ hybridization of chromosomal preparations and other physical 
chromosome mining techniques may be correlated with additional genetic msp data. Examples 
5 of genetic map data can be found in the 1994 Genome Issue of Science (265:1981f). Correlation 
between the location of a nucleic acid on a physical chromosomal map and a specific disease (or 
predisposition to a specific disease) may help delimit the region of DNA associated with that 
genetic disease. The nucleotide sequences of the subject invention may be used to detect 
diSerences in geae sequences between normal, carrier or a£fected individuals. 

10 4.20 PREPARATION OF SUPPORT BOUND OUGONUCLEOTTOES 

OHgonucleotides, i.e., small nucleic acid segments, may be readily prq)ared by, for 
example, directly S3aithesizinglhe oligonucleotideby chemical means, as is commonly practiced 
using an automated oligonucleotide synthesizer. 

Support boimd oligonucleotides may be prepared by any of tiie methods known to those of 

15 skill in the art using any suitable siqjpoit such as glass, polystyreaae or Teflon. One strategy is to 
preciselyspotoligonucleotidessynthesizedby standardsynthesizers. Immobilization can be 
achieved using passive adsorption Qnouye & Hondo, (1990) J. Clin. Microbiol. 28(6) 1469-72); 
using UV light (Nagatae^fli, 1985; Dahlen e/ a/., 1987; Moiiissey& Collins, (1989) MoLCdl 
Probes 3(2) 189-207) or by covalent binding of base modified DNA (KeUere/oi, 1988; 1989); aU 

20 references being specificaUy incoiporatedheaiein. 

Another strategy that may be employed is the use of the strong biotin-stiqjtavidm 
mteractionas a linker. For example, Broude«/oZ (1994)Proc. Natl. Acad. Sd. USA 91(8) 3072-6, 
describe the vse of biotinylated probes, although &ese are duplex probes, that are immobilized on 
streptavidin-coated magnetic beads. Streptavidin-coated beads may be purchasedfiomDynal, 

25 Oslo. Of course, this same linldng chemistry is applicableto coating any surfacewithstieptavidin. 
BiotirQrlated probes may be purchased firom various sources, such as, e.g., Operon Technologies 
(Alameda, CA). , 

Nunc Laboratories (NaperviUe, IL) is also selling suitable material that could be used. Nunc 
Laboratories have developedamethodby \^chDNAcanbe covalentlyboundtothemicroweU 
30 surfece termed CovalinkNH. CovaLinkNHisapolystyrenesurfacegrafledwithsecondaiyanuno 
groups (>NH) that serve as bridge-heads for flirther covalent coupling. CovaliDkModules may be 
purchased from Nunc Laboratories. DNA molecules may be bound to CovaLink exclusively at the 
S'-endby a phosphoramidatebond, allowing immobilization of more than 1 pmolofDNA 
(Rasmussenfi/ aJ.. (1991) Anal. Biochem. 198(1) 138-42). 
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The use of CovaLinkNH strips for covalent binding of DNA molecules at Ihe ^-end has 
heejy described (Rasmussen et al., (1991). In this technology, a phosphoramidatebond is etoployed 
(Chu et al., (1 983) Nucleic Adds Res. 1 1 (8) 65 13-29). Hiis is beneficial as immobilizationusiiig 
only a single covalent bond is preferred. The phosphoramidate bond joins the DNA to the 
S CovaLinkNH secondary amino groups that are positioned at the end of spacer arms covaloitly 
grafted onto the polystyrene sur&ce through a 2 nm long spacer aim. To Unk an oligonucleotide to 
CovaLink NH via an phosphoramidatebond, the oligonucleotide terminus must have a 5'-end 
phosphate groiq>. It is, peili^)s, even possible for biotin to be covalently bound to Covalink and 
then streptavidinused to bind the probes. 

10 MorespeciGcally,thelinkageme{hodinchidesdissolvingDNAinwater(7.5ngAil)aiid 
denaturingfor 10 min. at 95°C and cooling on ice for lOmin. Ice-cold 0.1 M l-metfayiimidazole, 
pH 7.0 (l-Melm?), is then added to a final concetitrafionof 10 mM l-Melm?. A ss DNA solution is 
then dispensed into CovaLinkNH strips (75 ul/well) standing on ice. 

Carbodiimide 0.2 M 1 -elhyl-3-(3-dimethylaminopropyl)-carbo<Mimide (EDC), dissolved in 

15 10 mM 1-Melm7, is made fiesh and 25 ul added per welL The strips are incubated for 5 hours at 
50°C. Afier incubation the strips are washed using, e.g. , Nunc-Immuno Wash; first the wells are 
washed 3 times, then they are soaked with washing solution for S min., and finally tiiey are washed 
3 times (wiere in the vrashing solution is 0.4 N NaOH, 0.25% SDS heated to 50°C). 

It is contemplated that a fiirther suitable mefiiod for use with &e present invention is tbat 

20 describedin PCT Patent Application WO 90/03382 (Southern & Maskos), incorporatedhereinby 
refoience. This method of preparing an oligonucleotide bound to a siqpport involves attaching a 
nucleoside 3'-reagent through the phosphate group by a covalent phosphodiester link to alifdiatic 
hydroxyl groups carried by tiie sm^rt The oligonucleotide is &en synthesized on the supported 
nucleoside and protecting groups removed fix>m fhe synthetic oligonucleotide diain imder standard 

25 conditions that do not cleave the oligonucleotide fiom the support Suitable reagents include 
nucleoside pihosphoramidite and nucleoside hydrogen phosphorate. 

An on-chip strategy for the prq)arationof DNA probe for the preparation of DNA probe 
arrays may be employed. For example, addressable laser-activated photodeprotectionmay be 
employed in the chemical synthesis of oligonucleotides directly on a glass sxw&ce, as described by 

30 Fodotetal. (1991)Science251(4995)767-73,incorporatedhereinbyreference. Probesmayalso 
be immobilized on nylon supports as described by Van Ness etaL (1991)Nucleic Acids Res. 
19(12) 3345-50; or linked to Teflon using the method of Duncan & Cavalier (1988) Anal. Biochem. 
169(1) 104-8; all references being specificallyincorporaledherein. 
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To link an oligonucleotideto a nylon support, as described by Van Ness et al (1991), 
requires activation of ttie nylon sur&ce via alkylation and selective activation of Hbe 5'-aniinB of 
oligonucleotides vath cyanuric chloride. 

One particular way to prepare support bouTMl oligonucleotides is to utilize the 
5 light-generated synthesis described by Pease e/ a/., (1994)PNASUSA 91(11) 5022-6, incorporated 
herein by reference). These authors used currentphotolithogr^Mctedmiquesto generate arrays of 
immobilized oligonucleotideprobes (DNA chips). These methods, in v^ch light is used to direct 
the synthesis of oligonucleotideprobes in Ugh-density, miuiatutized arrays, utilize phobdabile 
5 -protectediV-acyl-deojiynudeosidephosphoramidites, sui&ce linker chemistry and vosalile 
10 combinatorial syn&esis strategies. A matrix of2S6 spatially defined oUgonucleotiideprobes may be 
generated in tins manner. 

421 PREPARATION OF NUCLEIC ACID FRAGMENTS 

The nucleic acids may be obtained from any appropriate source, such as cDNAs, genomic 
DNA, chromosomal DNA, microdissectedchromosome bands, cosmid or YAC inserts, and RNA, 
15 includingmKNAvdthoutanyampMcationstqps. For example, Sambn>okefa£ (1989) describes 
three protocols for tiie isolation of high molecular weight DNA fiom mammalian cells (p. 
9.14-9.23). 

DNA firagments may be prepared as clones in Ml 3, plasmid or lambda vectors and/or 
prepared directly fix>m genomic DNA or cDNAby PGR or other amplificationmethods. Samples 

20 may be prepared or dispensed in multiwell plates. About 100-1 ODD ng of DNA samples may be 
prepared in 2-500 ml of final volume. 

The nucleic acids would then be fiagmented by any of the methods known to tiiose of skill 
in the art including, for example, using restriction eazymes as described at 924-9.28 of Sambrook et 
aL(\ 989), shearing by ultrasound and NaOH treatment 

25 I^wpressureshearingisalsoappropriate,asdescribedby Schrieferefoi (1990)NucIeic 

Acids Res. 18(24) 7455-6, incorporatedherein by reference). In this method, DNA samples are 
passed throu^ a small French pressure cell at a variety of low to intermediate pressures. A lever 
deviceallowscontrolledapplicationof lowto intermediate pressures to the cell. Theresults of 
these studies indicate ibst low-pressure shearing is a usefiil alternative to sonic and enzymatic DNA 

30 fiagmentationmediods. 

One particularly suitable way for fiagmenting DNA is contemplated to be that using the two 
base recognition endonuclease, CviJI, described by Fitzgerald etaL{\ 992) Nucleic Acids Res. 
20(14) 3753-62. These authors described an approadi for the r^id fiagmentatian and fiactionation 
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of DNA into paiticulai sizes that they conteniplated to be suitable for shotgun cloning and 
sequencing. 

The Testriction endonuclease CviJI noimally cleaves ibs recognition sequence PuGCPy 
between the G and C to leave blunt ends. Atypical reaction conditions, which alter the specificity of 
5 this enzyme (CraJI**), yield a quasi-random distribution of DNA fragments form flie small 
molecule pUC19 (2688 base pairs). Fitzgerald e/a/L (1992) quantitatively evaluated the 
randomness of this fiagmentation strategy, using a CvUl* * digest of pUCl 9 that was size 
fractionated by a rapid gel filtration method and direcdy ligated, without end repair, to a lac Z minus 
Ml 3 cloning vector. Sequoice analysis of 76 clones showed that CvOl* * restricts pyGCPy and 
10 PuGCPu, in addition to PuGCPy sites, and that new sequence data is accumulated at a rate 
consistent with random fiagmentation. 

As reported in the literature, advantages of this approach compared to sonicatian and 
agarose gel firactionation include: smaller amounts of DNA are required (02-0.5 ug instead of 2-5 
ug); and fewer st^s are involved (no preligation, end repair, chemical extraction, or agarose gel 
IS electrophoresisandelution are needed 

Irrespective of die manner in wfaidi the nucleic acid firagments are obtained or prepared, it is 
important to denature the DNA to give single stranded pieces available for hybridization. This is 
achieved by incubatingthe DNA solution for 2-5 minutes at 80-90°C. The solution is then cooled 
quickly to 2°C to prevent rraiatuiationof the DNA fiagmeots before they are contacted with the 
20 chip. Phosphate groiips must also be removed fiom genomic DNA by methods known in the art. 

422 PREPARATION OF DNA ARRAYS 

Arrays may be prepared by spotting DNA samples on a siqiport sudi as a nylon memtnane. 
Spotting may be performed by using arrays of metal pins (the portions of v^ch correspond to an 
array of weUs in a microtiter plate) to rq>eated by transfer of about 20 nl of a DNA solution to a 

25 nylon membrane. By offiet printing, a density of dots higher than the density of Ihe wells is 

achieved. One to 25 dots may be accommodated in 1 mm^ depending on the type of label used. By 
avoiding spotting in some preselected number of rows and colunms, separate subsets (subairays) 
may be fi)nned. San^les in one subarray may be die same genomic segment of DNA (or die same 
gene) fiom different individuals, or may be different, overl^jped genomic clones. Each of the 

30 subarrays may represent rqdica spotting of the same samples. In one example, a selected gene 

segment may be amplified fiv}m 64 patients. For each patient, the amplified gene segmentmay be in 
one 96-well plate (all 96 wells containing the same sample). A plate for each of die 64 patients is 
prepared. By using a 96-pin device, all san^}les may be spotted on one 8 x 12 cm membrane. 
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Subanays may contam 64 samples, one fiom each patient Where the 96 subanays are identical, the 
dotspanmaybe 1 mm'andtheremaybea 1 mmspacebetweensubanays. 

Anotiiet approach is to use membranes or plates (available fiom NUNC, Naperville, lUinois) 
which may be partitioned by physical spacers e.g. a plastic grid molded over the membrane, the grid 
5 being similar to the sort of membrane applied to the bottom of muhiwell plates, or hydrophobic 
strips. A fixed physical spacer is not preferredfor imaging by e^qrasureto fiat phosphor-storage 
screens or x-ray films. 

Hie present invention is illustrated in the following exanq)les. Upon consideration of the 
■ present disclosure, one of skill in the art will appreciate that many other embodiments and yariations 
1 0 may be made in the scope of 1he present inveotioiL Accordingly, it is intended that the broader 
aspects ofthe present invention not be limited to the disclosure ofthe following examples. The 
present invention is not to be limited in scope by the exemplified embodiments which are intended 
as illustradons of single aspects of the invention, and compositions and methods which are 
fimctionaUyeqiiiva]^ are within the scope of the invention. Indeed, numerous modifications and 
1 S variations in the practice of the invention are expected to occw to those skilled in die art upon 
considerationofthe present preferred enibodiments. Consequently, &e only limitations ^nch 
should be placed \xpon the scope of ihe invention are those ^^ch appear in the appended claims. 

AU references cited within the body of ihe instant specification are hereby incorporatedby 
reference in their entirety. 

20 5.0 EXAMPLES 

5.1 EXAMPLE 1 

Novel Nucleic Acid Sequences Obtained From Various Libraries 
A plurality of novel micleic acids were obtained from cDNA libraries prepared from various 
human tissues and in some cases isolated fitnn a genomic library derived fivm human chromosome 
25 using standard PCR,SBH sequence signature analysis and Sanger sequencing techniques. The 
inserts of the Ubrary were amplified with PCR using primers specific for the vector sequences 
which flank the inserts. Clones fiom cDNA libraries -wers spotted on nylon membrane filters and 
screened widioligonucleotideprobes(e.g., 7-mers) to obtain signature sequences. Thecloneswo^ 
cliistered into groups of similar or identical sequences. Representaliveclones were selected for 
30 sequencing. 

In some cases, the 5' sequence of the amplified inserts was then deduced using a typical 
Sanger sequencing protocol. PCR products were purified and sutgected to fluorescent dye 
terminator cycle sequraidng. Single pass gel sequencing was done using a 377 Aj^lied Biosystems 
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(ABI) sequencer to obtain the novel nucleic add sequences. In some cases RACE (Random 
Amplification of cDNA Ends) was perfonnedto further extend the sequenceintfae S' direction. 

52 EXAMPLE! 
5 Novel Contigs 

The novel conligs of the invention were assembled from sequences diat were obtained fiiom 
a cDNA library by methods described in Example 1 above, and in some cases sequences obtained 
from one or more public databases. The sequences for the resulting nucleic acid contigs are 
designated as SEQ ID NO: 1-1350 and are provided in the attached Sequence Listing. The contigs 

10 were assembled vising an EST sequence as a seed. Then a recursive algorithm was used to extend 
the seed EST into an extended assemblage, by pulling additional sequences from different databases 
(i.c., Hyseq's database containing EST sequences, dbEST version 114, gjj pri 114, and UniGenB 
version 101)tfaat belong to this assemblage. The algorithm tenmnatedvs^ea there was no 
additional sequences firom the above databases that would extend the assemblage. Inclusion of 

15 component sequences into the assemblage was based on a BLASTN hit to the extending 
assemblage with BLAST score greater tiian 300 and percent identity greaterthan 95%. 

Table 3 sets forth the novel predicted polypeptides (including proteins) encoded by the 
novel polynucleotides (SEQ ID NO: 1 89-282) of the present invention, and their corresponding 
nucleotide locations to each of SEQ ID NO: 1 89-282. Table 3 also indicates the method by \^*idi 

20 the polypeptide was predicted. Method A refers to a polypeptide obtained by using a software 

program called FASTY (available fiom http://fasta.biodi.virginia.edu^ which selects a polypeptide 
based on a comparison of the translatednovel polynucleotide to known polynucleotides (WJR. 
Pearson, Methods in Enzymology, 183:63-98 (1990), herein incorporated by reference). MethodB 
refers to a polypeptide obtained by using a software program called GenScan for human^^BItebrate 

25 sequences (available fix}m Stanford University, QfBceofTechnology Licensing) that predicts the 
polypeptide based on a probabilistic model of gene structure/compositionalproperties (C. Burge 
and S. Karlin, J. Mol. Biol., 268:78-94 (1997), incorporatedhereinby reference). Method C refers 
to apolypeptide obtained by using a Hyseq proprietary software program that translates ^e novel 
polynucleotide and its complementary strand into six possible amino acid sequaices (forward and 

30 reverse firames) and chooses the polypeptide with the longest open reading frame. 

The nearest neighbor results for SEQ ID NO: 1-1350 were obtained by a BLASTP 
version 2.0al 19MP-WashU search against Geiq)ept release 120 and Geneseq database October 
12, 2000, update 21 (Derwent), using BLAST algorithm. Hie nearest neighbor result showed the 
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closest homologue for SEQ ID NO:1-1350. The nearest neighbor results for SEQ ID NO: 1- 
1 350 are shown in Table 2 below. 

Tables 1, 2 and 3 follow. Table 1 shows the various tissue sources of SEQ ID NO: 1-1350. 
Table 2 shows the nearest neighbor result for the assembled contig. The nearestneighborresult 
5 shows the closest homolog with an identifiable function for each assemblage. Table 3 contains the 
start and stop nucleotides for the translated amino acid sequence for vAsich each assemblage 
encodes. Table 3 also provides a correlationbetweenthe amino acid sequences set forth in the 
Sequence Listing, the nucleotide sequences set forth in the Sequence Listing and ibe SEQ ID NO. in 
USSN 09/496,914. 



98 



wo 01/57188 



PCT/USOl/03800 



TABLE 1 



Tissue Qrigjii 


RNA Smircfi 




SEO ID NOR- 


adult brain 


GIBCX3 




111 1S1 IKK 71^ X77 QinQ77 

976 1233 1319 


adult l»m 


GIBCO 


ABD003 


41 49 74 101 111 120 132 141-142 151 
217 225 238 271 317 404 446 469 503 
513-514 535 550 564 573 666-669 798 
898 910 927 976 1067 1083 1085 1178 
1254 


adult brain 


Clontedi 


ABROOl 


39 216 238 327 356 535 927 1056 1121 
1178-118011991251 


adult brain 


Clontecb 


ABRD06 


74 611949 10341136 


adult brain 


Clontedi 


ABR008 


14 32 41 61 81 86 89 120 132 138 145 
147 188 197 208225 227-239 250 300- 
303 312316 328-331 340 357-362374 
380 384-391 408 414 446 448 464-467 
483 488 495-496 505 512 521 535 550 
566 571 577 585 590 594 598 634 64 1 
658 666 683 725 742 764 767 786 801 
805 810 823 826 829 831 836 841 887- 
923 927 934 943 950-951 963 976 995 
1000-1001 1006 1026 1034 1048 1057- 
1067 1086 1088 1090 1118 1120 1122- 
1128 1142 1162 1181-1192 1199 1204 
1218-1219 1225 1232 1253 1267 1271- 
1306 1342 1347 1349-1350 


adult brain 


Clontedi 


ABRDll 


49 238 1219 


adult brain 


BioChffln 


ABR012 


74238 


adult brain 


Invitiogen 


ABR013 


868 1268 


adult brain 


bvitrogen 


ABIXMM 


49 117 138 191 217 252 291 305 535 
566 596 663 670 746 798 816-819 876 
892 898 922 943 963 1034-1036 1 121 


cultured 
preadipocytes 


Strategene 


ADPOOl 


41 74 101 138 21 1 238 304 537 582 
740 798 883 943 976 1067 


adrenal gland 


Clontedi 


ADR002 


4974 101 111 120127 151215 238 
240-247 316 330 363-364 404 414 534- 
535 833 924-940 950 963 976 1001 
1003 1067-1070 1118 1156 1193-1200 
1325 


adult heart 


OIBCO 


AHROOl 


38 49 71-72 74-77 79 9299101111 
118 129 132 138 151 158-163 182 195- 
203 215 217 238 264 269 353 384 398 
408 434-439 446 504 512-513 519 537 
562-573 577 611-614 616-619 658 661 
671-672 722 734 757-773 815 828-835 
874 891 898 919 926-927 976 988 
1021 1037 1041 1062 1067 1071 1080 

1U8J Iwi 11Z7 1131 1183 l^Ul 1.Z04 

1308 1331 1335 


adult kidney 


GIBCO 


AKDOOl 


41 49 51 71-74 78-85 94 100-101 103- 

1 AT 111 iin imioo 1C1 1 cn i c 01 t 
107111 119-120138 151 157 215 217- 

21 8 238 IfiA 2Q4 30d 384 404 440 

446 454 477 504-505 509 514 518-519 

535 537 564 574-583 620-627 639 653 

673-675 705 753 789 831 844 851 859 

877 909 918 927 956 963 976 1067 

1074 1083 1095 1178 1302 1331 1335 


adult kidney 


InvitiDgQi 


AKT002 


11-12 4149111-112215-217294 316 
446487 564575844 868910927 976 
1116 


adult hmg 


GIBCO 


ALGOOl 


8101 111 151 187402446490 514 



99 



wo 01/57188 



PCT/DSOl/03800 



Tissue Origin 


KNA Source 


Hyseq Library Name 


SEQEDNOS: 








518 537 545 549 580 582 592 594 634 
640 651-652 676^78 725 851 873 918 
952 976 1042 1067 1076 1083 1152 


lymph node 


Clontecfa 


ALNOOl 


8111 121 151 180-182 188215 537 
545 549 651 679-682789 804^10 868 
873 927 952 976 1042 1059 1335 


yoimg liver 


GIBCO 


ALVOOl 


8 64 79 1 1 1 186 215-216 238 446 514 
519 537 564 653 683-684 698 753 798 
813 833 840 858 927 976 1038-1039 
1051 1085 1224 1245 1256 


adult liver 


Invitrogen 


ALV002 


40 71 292-293 305 384 46*469 496 
505 657 675 714 753 832 844 941-942 
976 1040 1076 1256 1293 


adult liver 


Ckmtech 


ALV003 


976 


adult ovary 


Invitrogeo 


AOVOOl 


8 32 36 38 41 49 51 71 74 79-80 101 
104 ] 1 1 120 122-125 138 140 143-149 
151 188-190207-212215-217238 264 
316384 409 440445-446496 504512 
514 518-519 535 537 549-550 564 566 
571 580 582 600 618 638 657 667 681 
685-697 699 705 722 735-744 761 771 
815 833 842-865 868 875-876 918 926- 
927 950 952 963 976 1023 1042 1048 
1051 1059 1072 1076 1083 1117 1120 
1124 1131 1144 1174 1224 1268 1331 
1335 


adult placenta 


Clontedi 


APLOOl 


102217238 537641 700 


placenta 


Invitrogen 


APL002 


663 851 1048 


aduh spleen 


Gmcx) 


ASPOOl 


8 45 74 1 1 1 132 140 151 185 217 238 
294 414 446 477 504 514 534 545 549 
592 722 873 883 952 976 1041-1042 
1083 1093-1094 1152 1224 


testis 


GIBCX) 


ATSOOl 


72 107 111 113 126 140 151 183215 
238 446 497 537 642 701-706 811 877 
927 962 976 108311171131 


adult bladder 


Invitrogen 


BLDOOl 


41 151 191 402-405 409 414 496 545 
592 607 706 873 952 1178 1329-1335 


bone marrow 


Clontecb 


BMDOOl 


8 58-62 65-68 74 79 108 111 116 137 
147 151. 164-174213-215238 305-307 
374 404 446 460 466 S 16 519 534 538- 
541 544-546 549-554 566 584 586 592 
596 607 610 628-^29 643-645 652 707- 
708 774-789 844 866-871 873 919 927 
952 963 976 998 1034 1042 1064 1083 
1085 1120 1132 1152 1225 1229 1268 
1307 1310 


bonemairow 


Clontecfa 


BMD0Q2 


6 8 37-38 52 74 77 105 111 129 132 
210 317 510-51 1 545 549 581 598 628 
638 724 766 789 844 860 868 873 919 
927 952 963 968976 10421111 1141 
1160-1161 12291266 1346 


bone maiTow 


Clontech 


BMD004 


111238 282 549 1083 


adult colon 


InvitrDgen 


CLNOOl 


52 260 264 299 494 536 545 564 592 
844 873 877 952 976 1042 1152 1268 
1336-1337 


adult cervix 


BioChain 


CVXOOl 


49 51 129 132 151 205 207 238 332- 
335 365-367 392-401 440 466 47tM71 
518 537 597 629 832 877 927 976 1006 
1085 1117 1129-1134 1192 1202-1205 
1219 1309-1328 


diaphragm 


BioChain 


DIA0Q2 


74 976 1083 



100 



wo 01/57188 



PCT/US01/038(M) 



Tissue Origin 


RNA Source 


Hyseq Library Name 


SBQIDNOS: 


eodoflielial cells 


Strategene 


EDTOOl 


32 40-41 49 74 79 lOl H 1 120 132 
138 151 204-206 21S-217238 269 316 
414 433 505 510 513 550 555 580 582 
596 675 722 745 798 814 836-841 851 
918 976 1041 1043 1073 1083 1131 
1331 


GencMiiic clones 
from the short arm 
of chromosome 8 


Genomic DNA 
from Genetic 
Research 


EPMOOl 


525-532927 


Genomic clones 
from file short ami 
of chromosome 8 


Genomic DNA 
fhim Genetic 
Research 


EPM003 


47525 


G^icxnic clones 
from fbt short aim 
of chromosome 8 


Goiomic DNA 
from Genetic 
Research 


EPM004 


525 927 


Genomic clones 
from the short arm 
ofchromoscHneS 


Gmomic DNA 
from Genetic 
Research 


HPM005 


531 


esophagus 


BtoCSiain 


ESO002 


74 138 238 


fetal Ihbid 


Clontech 


FBROOl 


441-442927 


fetal brain 


Clontech 


FBR004 


215 893 927 1001 


.fetal brain 


Clontedi 


FBR006 


4S61 101 170 Wl X'K'k MO 147 90fi 

VA IvA Vi*\M UXt UO 1*TV 1^/ JCA/O 

225 271 317 319 336 359 368 405-414 
519 550 571 594 686 715 722 764 824 
829 836 859 909 927 94'^ 947 961 10^7 

VJir^ UmJV ^Jtl yr/ 7\IJ A\U t 

1067-1068 1104 1135-1140 1162 1206- 

* # A WW A A A A*i'^ A A~V A A AX* 

1207 1235 1268 1288 1307-1308 1319 
1338-1350 


^al brain 


Clontedi 


FBRs03 


111 446 


fetal brain 


Isvibogen 


FBT002 


41 51 120 151 192-194 264 504 512 
535 683 761 798 820-827 844 876 909 
963 976 1026 1048 1083 J 144 1302 


fetal heart 


Invitrogen 


FHRDOl 


446 566 761 


fetal kidney 


Clontedi 


FKDOOl 


51 74 111 127 140 151 184 294 537 
550 630-631 1319 


fetal kidney 


Clontech 


FKD0Q2 


1119761083 


fi^ kidney 


Invitrogen 


FKD007 


238 974 


fetal lung 


Clontech 


FLGOOl 


463 566 976 1074 1083 1093 


fetal hmg 


Invitrogen 


FLG003 


41 238 330 407415-416 537 573 844 
859 1048 1083 11161192 


fetal liver-spleen 


Cohunbia 
University 


FLSOOl 


8 14 34-35 37 41 43 49 51 54-56 63-64 
69-71 74 7779 87-90 101 107110-111 
114 120 128-131 138 140 147 150-155 
197 210 215 217 225 238 312 367 384 
414 440 446 460 468 483 496 504-507 
511-515 518-519 523 533-535 537 541 
544-545 547-550 555-560564 566 571 
577 582 585-586 598 636 646-647 649 
652664698709-710714722-723 731 
735-736 746-753 761 784 798 823 829 
832 844 851 858-859 868 873 876 898 
927 943 949 952 963 976 984 1002 
1021 1023 1040 1042 1044 1050 1083 
1093 1116112011291131 1144 1174 
1217 1251 1254 1256 1302 1308 1311 
1319 


fetal liver-spleen 


Columbia 
University 


FLS002 


8 36-3741-46 4954 64 71 74 79 101 
111 120 129 147207 210215-216238 
250 330 353 359 366 383-384 414 478 
505 508-509 51 1 515-524 534-535 537 
544-545 564 566 571 577 591 598 638 
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Tissue Origm 


RNA Source 


Hyseq Library Name 


SEQ BONOS: 








663 671 698 714 722 725 727 751 798 
851 859 873 876 909 927 949 952 983- 
984 1002 1023 1042-1044 108S 1095 
1131 1144 1178 1199 1233 1240-1270 
1331 1340 


fetal hvarspleea 


Columbia 
University 


FLS003 


64 535 976 1256 


fetal livo' 


Invitrogen 


FLVOOl 


8 101 120 138 217 446 468 535 566 
580 722 730 749 844 918 943 976 1051 
1256 1331 


fetalliver 


Clontech 


FLVCKW 


5379261256 


fetal sniscle 


Invitn^en 


FMSOOl 


51 111 264 312 369-370404417-421 
425 535 537 577 598 614 836 857 1 141 
1208 1268 


fetal muscle 


Invitrogen 


FMS002 


537 


fetal skin 


Invitrogen 


FSKOOl 


13-26 32 41 51 89 107 1 11 147 151 
225 264 316 405 422-429 488-494 496 
519 534-535 537 566 675 732 859 876- 
877 898 947 949-950 963 976 1001 
1062 1076 1083 1117 1144 1165 1268 
1281 


fetal skin 


Invitrogen 


FSK002 


537812 


fetal spleen 


BioChain 


FSPOOl 


87 549 


umbilical cotd 


BioQiain 


FUCOOl 


27-33 41 49 151 215 238248-249301 
316 446 495-503 519 521 534-535 537 
582 634 691 877 883 927 944-950 963 
976 1001 1075 1142-1143 1171 1218 
1243 1308 


fetal brain 


GIBCO 


HFBOOl 


41 49 57 79 87 103 1 1 1 120 132-135 
138 145 151 188 197 207215238264 
271 294 316 367 414 440 446 466 504 
513-514 535 542-543 550 564 571 596 
635 648-654 675 711-715 722-723 798 
832 872 876 883 927 976 1095 1 144 
11681171 1178 1211 1335 


macrophage 


Invitrogen 


HMPOOl 


238 


infent brain 


Columbia 
Univeraity 


IB2002 


49-50 77 81 89 105 111 136-138 140 
151 161 175-179 185216-217264295 
299 308-310 371-373 462 476 504 51 1- 
513 533 537 564 566 571 655-657 662 
683 716-720 723 752 790-803 829 832 
858-859 876 898 909 949 976 1045- 
1047 1076-1087 1090 1093 1116 1122 
1144 1209-1213 1225 1233 1256 1319 
1341 


infent brain 


Columbia 
University 


032003 


41 50 77 104 132 215 238 508 512-513 
519 566 655 714 794 918 943 976 1067 
1092-1093 1233 


mfint brain 


Columbia 
University 


IBM002 


311472-473 753 1214 


infent brain 


Columbia 
University 


IBSOOl 


51 111 376 474 790 876 949 1 144 1204 
1221 


img , fibroblast 


StTBtegene 


LFBOOl 


151 316 462 514 534 582 675 939 1131 


tungtmnw 


Invitrogen 


LGT002 


1-7 41 74 79 94 115 120 138-139 156 
215 217 269 280 296 337 374-375 384 
404 446 454 475-480 498 514 518-519 
522 537 545 564 577 597 653 658 705 
721-724 754-756 779 859 868 872-874 
876-877 919 927 949 95 1-952 959 976 
1002 1042 1048-1053 1076 1083 108&- 
1089 1131 1144-1147 1216-1218 1229 
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KNA Source 


Hyseq lihraiy Name 


SEQIDNOS: 








1293 1311 


lymphocytes 


ATCC 


LPCOOl 


41 74 111 132 151 253 316446 550 
634 844 927 976 108S 1268 


leukocyte 


GIBCO 


LUCOOl 


81141748691-98101 109111120 
147 151 212 215218 238 252 288 312- 
314316338359 408 427443-447505 
510 512 514 518 534 545 549-550 561 
564 566 571 577 580 582 587-609 615 
632-638 658-659 698 714 725-728 832 
836 841 859 866 873-874 882-883 918- 
919 927 943 952 963 976 1042 1076 
1083 1090 1148 1152 1168 1195 1219- 
1220 1224 


leukocyte 


Clontedi 


LUC003 


74 100 215 232 238 339-341 446 545 
657 660 729 873 883 927 952 963 1008 
1042 1116 1120 1149-1150 1215 1222 


Melanoma from cell 

lmeATCC#CRL 

1424 


Clontech 


MEL004 


210 215 238 342 534 545 592 722 873 
919 929 939 952 976 1071 1118 1218 
1235 1245 


mammary gland 


Invitrogen 


MMGOOl 


8-10 40-41 49 73 80 114 138-140 147 
217 250-256 264 297-299 305 377-378 
398 446 481-486 SOS 512 537 545 549 
571 592 725 730-733 816 829 836 844 
868 873 876-877 898 926 943 951-960 
963 976 995 1034 1042 1048 1054- 
1055 1076 1083 1091 1093 1116-1117 
11241152 1302 


induced neuron cells 


Strategene 


NTDOOl 


39101 111138238 361 1225 1251 
1319 


retinoid acid induced 
neuronal cells 


Sliategene 


NTROOl 


74225 976 


neuronal cells 


Strategene 


>mjooi 


129 225 238 304 3 13 361 657 976 


pituitaiy gland 


Clontecfa 


prn)04 


976 


placenta 


Clontedi 


PLA003 


38 976 


prostate 


Clontedi 


PRTOOl 


11 1 188 238 257-258 564 724 961-966 
1067 1095 


rectum 


Invitrogen 


RECOOl 


238 430-431 841 859 868 963 1001 
1116 


salivaiy gland 


Clontedi 


SALOOl 


8 151 402 432-433 446 496 868 952 
9761083112011511184 


small intestine 


Clontedi 


SINOOl 


8 101 147 215 259-266 446 462 505 
545 592 660 789 836 866 873 927 952 
963 967-978 1042 1120 1152 1223- 
1224 


skeletal muscle 


Clontedi 


SKMOOI 


238 302 927 943 992 1031 


spinal cord 


Clontedi 


SPCOOl 


74 111 132 151 215-216238264267- 
270 343-344 353 379 516 537 566 740 
828 927 976 979-994 1092 1 153-1 159 
1225 1250 


adult spleen 


Clontech 


SPLcOl 


698 859 1042 


stomach 


Clontech 


STOOOl 


210 238 271-272 537 580 705 918 952 
995 1171 


thalamus 


Clontech 


THA002 


61 219-220 273-276 3 12 315 330 596 
963 996-1007 1059 1093 1160-1162 


tiiymus 


Clonetech 


THMOOl 


8 120 151 208 221 316-317353 639 
750 867 874 878-881 927 963 1023 
1083 1094-1096 1124 


tiiymus 


Clontech 


THMc02 


8 61 114 129132210 225231306 
317-319 336 340 359 380 398 446 448- 
463 512 519 545 554 587 598 698 724- 
725 789 812 836 868 873 927 947 952 
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RNA Source 


Hyseq Libiaiy Name 


SEQIDNOS: 








976 1007 1042 1083 108S 1097-1116 
1122 1147 1177 1226-12291234 1311 
1313 


tiiyroid gland 


Clontech 


IHROOl 


144149 7694111144 ISl 183 188 
210 217 222 253 264 271 277-286 294 
320-326 345-352 361 381-382446 467 
483 514 534 549-550 564 578 602 649 
844 882-883 927 950 956 976 1008- 
102810761083 1117-112011421163- 
1175 1230-1238 1308 


trachea 




TRCOOl 


223-225 238 287 353-354 514 
545 592 61 1 873 883-884 927 
952 1029-1031 1042 1151-1152 
11701176-11771239 


uterus 


Clontech 


UTROOl 


151 226 288-290 355 537 877 
885-886 976 1001 1032-1033 
1232 



TABLE 2 



SEQ 


Accession 


Species 


Description 


Smith- 


% 


ID 


No. 






Waterman 


Identity 


NO: 








Score 




1 


B02829 


Homo sapiens 


Hunan G protein coupled recqrtor hRUFS 


460 


100 








protein SEQ ID NChlO. 






1 


G03S64 


Homo sq>iais 


Human secreted protein, SEQ ID NO: 764S. 


111 


51 


3 


R26173 


Homo sapiens 


Part of Mqor Yo paraneoplastic antigen 


293 


76 








(CDR62) encoded by clone pYZ 






4 


L29536 


Homo sapiens 


calctmn cfaanne! L-type alpha 1 subimit 


191 


65 


c 

J 




Homo sapiens 


Human secreted protein clone ytl4_l protein 












sequence SEQ ID NO:92. 






6 


Ml 1507 


Homo sapiens 


transferrin receptor 


120 


95 


7 


AF099100 


Homo sapiois 


WD-repeat protein 6 


1941 


93 


8 


Y92338 


Homo s^iens 


Human cancer associated antigen precursor fiom 


245 


82 








clone NY-RBN-45. 






9 


G01343 


Homo sapiens 


Human seoeted protein. SEQ ID NO: S424. 


226 


91 


10 


AJ133798 


Homo sapiens 


copine VII protein 


1127 


68 


11 


002449 


Homo sapiens 


Human secr^ protein, SEQ ID NO: 6530. 


584 


99 


12 


X98330 


Homo sapiens 


ryanodine receptor 2 


282 


78 


13 


AL024498 


Homo sq>iens 


dr417M14.2 (novel serineAhreonine-protein 


293 


too 








kinase (ottholog of mouse and lat MAK (male 












germ cell-associated kinase)) 






14 


AF045S77 


Fan 


ol&ctory receptor OR93Ch 


191 


36 






troglodytes 








15 


G03131 


Homo salens 


Human seoreted protein. SEQ ID NO: 7212. 


93 


39 


16 


U26595 


Kattos 


prostaglandin F2a receptor regulatory protein 


569 


89 






norvegicus 


precursor 






17 


B0891S 


Homo sapiens 


Human secreted protein sequence encoded by 


99 


44 








gcne28SEQIDNO:7S. 






18 


Y36203 


Homo sa^iois 


Human secreted pnrtem #73. 


163 


75 


19 


U15647 


Mus 


reveise transcriptase 


106 


40 






musculus 








20 


G02701 


Homo sapiens 


Human secreted protein. SEQ ID NO: 6782. 


544 


100 


21 


Y33923 


Homo s^iens 


Extended human secreted protein sequence, SEQ 


1691 


100 








ID NO. 172. 






22 


G04030 


Homo sqiiens 


Human secreted protein, SEQ ID NO: 8111. 


380 


96 


23 


G024SS 


Homosapiois 


Human secreted protein. SEQ ID NO: 6536. 


123 


30 


24 


AF036329 


Homo sapiens 


gonadotropin-rdeasing hormone precursor. 


284 


90 








secondftnn 






25 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 8148. 


96 


32 


26 


S80119 


Rattiissp. 


reverse transcriptase homidog 


100 


34 


27 


U83303 


Homo sapims 


lme-1 reverse transcriptase 


101 


33 


28 


G03267 


Homo s^qjiens 


Human secreted protein, SEQ ID NO: 7348. 


135 


45 
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SEQ 


Accession 


Species 


DescriptioQ 


Smitii- 


% 


ID 


No. 






Watennan 


Identity 


NO. 








Score 




29 


G04067 


Homo sapiens 


Hunan secreted pioteia, SEQ ID NO: 8148. 


83 


42 


30 


G02872 


Homo salens 


Human secreted proteiii, SEQ ID NO: 6933. 


116 


72 


31 


G03371 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7452. 


96 


67 


32 


G03224 




Humfin (Kcrptcvl nmtem 5vRO ID NO* 7305 


58 


32 


33 


Y66688 




MemhranfiJmiind nmtein PR01152 


2457 


98 




J O/v/I 




T4itfnnn oM^fwffMl nmtwn cAfmmCft SKC) ILl 
jjiiiiinii ifcvmcu ujvidu odjumwc kjjD^ ju^ 


348 


95 


















nuinu supiciia 


pi£u 


182 


48 


jO 


I /^404 


Homo Salens 


riuman secinea pruieni cione yi<i__i piuicm 




90 








piiniinnno Clvf^ TT\ \iiC\*\ 

ScqUraiCe dl^^ 11/ fNl^.l^U. 






Sf 


AT I^^OK 


_J 

Homo s^ims 


DAiUoLf/.D ^semapnoim 4vv (^sona uomam, 


Oof 


go 








irriTniirmalnhnltn ilnmain flnri IrtinomilTini Qili* 
lUUllUOU^UDUUD UUiUUUl UOUaUlGUiUllUK 


















38 


AC067969 


flininn flcids 


Homo s^icDS lyanodino icc^tor 1 (slwletsl} 


386 


66 














39 


AL031S88 


Homo sapiens 


dJl 163J1.1 (mostly supported by GENSCAN, 


493 


76 


















nuiuo sapicus 


I4innan cprrRfrvl nmtpin SHI ) 1 1 1 'NO* 'I'JOQ 


110 


51 


A1 
ni 




Homo s&picQS 


LAJIO^ |MVlCUl 


228 


68 


AO 
HZ 


Z^OxOo 


Homo sapiens 


xluiuaii scwcicu proicio cuujum uy gene h-j. 


220 


88 


jfZ 
4j 


Ad1U4o 


nyara 


mini-collsgen 




33 


44 


M7oMo 


JneliantDus 


hyditHcypioline-rich pn>tem 


1 in 


^1 






annuus 








43 


f TOOODO 

Uli2zoo 


(^aenomatKiiti 
s el^&ns 


Kac"iiKe ulrase 




/w 


*K) 




Homo sapiens 


niunan sccicieo praicui, oex^ ilj y%\j, § jjo. 


118 


58 


4/ 


ArU!AJ!r4Z 


M^Hno sapiens 




113 


63 


40 


OUjXi4 


Homo sapiens 


riuman sccrcicQ proicui, oj_a^ jjj invj. /os-j. 


on 


59 


>to 


AJUU330U 


MUS 


orKZo proTBm 




JO 






musculus 








<A 
jU 


uUz43u 


Homo ss^iens 


riuman seCToteo proiem, oims^ uj nxj, j i . 


JO J 






iyio4y 


Homo sapiens 


miman scctclcu proLcui scquoicc ciiia/ucu uy 




94 


















fiomo sapiens 


ni 1 f u 1 ii/A ni 
pUiJulrC pj Jw 


103 


38 






Homo sapiCTS 


Unman GTf VO mmvtmn 

xiuman oiija^ pnnem* 




85 






Homo sapiens 


Human secretea proieiQ, oca^ lu inlf. oooo. 


Ml 


JO 




ADnnoiTC 


Mus 


hepatic nuclear fector l^ieta ^ort form 


J JO 


7<1 






musculus 


rn— r - . '■ 1 






3d 


M05741 


HcHno sapiens 


pTotem*'tyiosine pnopbatasc 


IAS 

lOJ 




D/ 


Af A01£AA 


Homo sa|)iens 


Cj!AIco.i ^cniOTioe oianiiei i) 




7fi 
/o 


JO 


Ai7ni 1AIT 
ArUi I'll / 


XAi«n 

MUS 


pultttivc phcFomonc receptor 




JJ 






musculus 








jy 






ZmC UDgCX piUlCIIl AtFSrK I J 


558 


68 














60 


U73036 




intcrfrmn reoiiltniv •fai'lur 7 


263 


% 


^1 

UJ 




hdus 


PlUIClU^jrilKuUC n IIBIffT 


297 


69 






miKCutus 








62 


Y2986r 


Hnmn onnicrm 


Hiimnn <:ecfsted mtitein clime clil% 4- 

X lull'"'* Q^VAVlJvU |f J Wl 1 VMJlAW ■^/•^w • 


791 


98 


63 


U35376 


IXUUIU 9ciyXWl0 


renrftssnr trflncfyintinnal fartnr 


485 


65 


64 


AF265555 


Hnmn cnntfinc 


ubionitin^nnhiealfQff HlK-domain enzvme 


785 


74 








APOLXjON 






65 


G03883 


Homn (sflnten^ 


Human secreted protein, SEQ ID NO: 7964. 


88 


95 


66 


AF 177390 


MflnducA 


antconal ^>ccific membrane protein AMP 


274 


54 






SGXtA 








67 


AB040800 


Homo salens 


SREB2 


614 


100 


6t" 






24 


213 


26 






hapesvinis4 








69 


G02963 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7046. 


261 


95 


70 


W75770 


Homo sapiens 


Human oxidoieductase YTFQS. 


1144 


98 


71 


AB01il3S 


Homo sapiens 


KIAA0S63 protdn 


239 


76 


72 


AB014885 


Halocyntbia 


HiTOPK-1 


813 


78 






raielzi 








73 


AF045454 


Cavia • 


phospholipaseB 


955 


73 






porcdlus 








74 


J02870 


Mus 


laminin leceptDT 


308 


^1 



ICS 
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iXJ 

NO: 


AccessiOE 

XNU. 


Species 


DesdipUon 


Smith- 

Watfinnan 

Score 


% 

Identity 






ID tUHJUIUs 








75 


Y00826 


Rattus 
ncHvegicus 


gp210 (AA 1-1886) 


413 


84 


#0 


AIT J tnnKA 


Homo sspicns 


thyroid hormone receptor-associated protein 
complex component TRAP240 


351 


54 


77 


Y38422 


Homo sq>ien3 


Human secreted proteia 


468 


76 


/O 




Homo sspiens 


Human T-type voltage-gated Ca channel alpha- 
l-I(hCavT3). 


1357 


99 


fy 




Human 

papilloDiavinj 

stype68 


APM-1 protein 


767 


100 




ATI OTOfVl 

AL1J78U2 


Homo sapiens 


dJ798A10^ (KIAA044S protem) 


71 


34 


91 


ArUUU3a3 


Arabidopsis 
tfaaliana 


protein argininc N-metfayltiansfisrase-like protein 


359 


65 


82 


L46815 


Mus 

musculus 


DNA binding protein Re 


895 


75 


83 


G01600 


Homo siq)icns 


Human secreted protein, SEQ ID NO: 5681. 


315 


96 


84 


Y53886 


Homo Siqiietu 


A suppressor of cytolcine signalling protein 
designated HSCX>P-6. 


338 


71 


85 


ABQ290U2 


Homo sapiens 


KIAA 1 079 protein 


134 


42 


QC 
OO 


¥28678 


Homo sapiens 


Human cw272_7 secreted protein. 


325 


62 


87 


Y99368 


Homo s^iiens 


Human PR01326 (UNQ686) amino add 
sequence SEQ ID NO: 100. 


156 


48 


88 


AJ22S124 


Mus 

musculus 


hyperpolaiization-ectivated cation diannel, 
HAC3 


487 


95 


on 
ay 


At 177203 


Homo sapiens 


cerebral cell adhesion molecule 


290 


56 


90 


Y28280 


Homo sapiens 


Human G-protein coupled recqptor GRIR-2. 


326 


79 


91 


L39891 


Homo sapiens 


polycystic kidney disease-associated protein 


1751 


95 


IVi 

fi 


AF064876 


Homo sapiens 


ion channel BCNG-1 


953 


99 


93 


AF170723 


Homo sapiens 


protein kinase STKIO 


401 


53 


94 


X13292 


Tiypanosoma 
brucei 


GPI-phosphoUpase C (AA 1 - 358) 


151 


37 


95 


Y34127 


Homo sapiens 


Human potassium channel K+Hnovl 1. 


661 


99 


96 


X03638 


Rattus 
norvegicus 


sodium diannel protdn I (aa 1-2009) 


1775 


92 


97 


AF134213 


Homo sapiens 


ubiquitin-spectCic protease 


1995 


99 


98 


GU0838 


Homo sapiois 


Human setxeted protein, SEQ ID NO: 4919. 


213 


38 


99 


AF021935 


Rattus 
norvegicus 


mytonic dystrophy Idnase-felated Cdc42-bindiiig 
k^iase 


675 


48 


100 


AF279265 


Homo sapiens 


putative anion transporter 1 


867 


98 


101 


AC007878 


Homo sapiens 


match to nuclear protein, NP220; note: sequence 
difference at residue 58 


160 


60 


102 


U22829 


Mus 

musculus 


P2Y puiinoceptor 


264 


42 


103 


Y45023 


Homo sapims 


Human sensory transduction G-protein coupled 
recept(n'-B3. 


516 


99 






— — ; 

Homo sapiens 


xluman scaetcd piotoin vb21_l, £>cQ XL) NU:20. 


787 


98 


105 


Y87342 


Homo sapiens 


Human signal peptide containing protem HSPP- 


343 


57 




AT 1 Z 


— : 

Homo salens 


hepatic angiopoietin-relsted protein 


212 


67 


107 


AF116657 


Homo sq)iens 


PR01310 


74 


52 


ins 


ACUUU4U1 


Eschoichia 
coli 


sialic acid tianspcHter 


587 


96 


109 


Y38395 


niilTIO aopicua 


tiuinaij secfcicu proiein euCOQeu oy gene rio. J v. 


093 


1 AA 
lUU 


1 in 


X /ooUJ 


Homo sapiens 


Hydrophobic domain containing protein clone 
HP00631 amino acid sequmce. 


182 


94 


111 




Homo sapiens 


nuclear pore complex protem hnup]53 


464 


85 


112 


Y94939 


Homo sapiens 


Human secreted protein clone ye90 I protein 
sequence SEQ ID NO:84. 


274 


51 


113 


AF016365 


Homo sapiens 


bexokinase 1 isofbim td 


301 


71 


114 


AC007956 


Homo s^iens 


unknown 


520 


75 


lis 


M83738 


Homo sapiens 


protein-tyrosine phosphatase 


251 


92 


116 


AL157952 


Homo sapiens 


dJ875K15.1.1 (ets homologous &ctor(ets- 
domain transcription fiwtor ESE-3A. isofbim 1)) 


484 


91 


117 


W18084 


Homo sqiieos 


Human Aiiroia-2. 


546 


87 



106 



wo 01/57188 
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SEQ 

ID 

NO: 


No. 






Watennan 
Score 


Identity 


118 


1/41816 


Homo sspicDS 


cfim lrinn.cp T 

Weill 1 ft II If iT>i X 






119 


AJ006710 


Rflttos 
nojvegicus 


nhosnTintufvliivmitnl IHnnc^ 






120 


AF0269S4 




nVTITVSlf^ HphvHmoiPnnc^ nhncnlintac^ ron iil atrtrti 
|;jriuvaic ucujr Ul ugcilUdC puuapuoUldC regUlaiOIjr 

subunit precursoi; PDPr 


10*K> 


OA. 


121 


S3 93 92 


TfoTinn cflnieim 


3.13.48} 




fist 

DO 


122 


1160805 




oncostatiD*Nf ^)ecific receptor beta subumt 




olo 


123 


Y44403 


Homo sapiens 


Human truncated tankyrase-l. 


111 


35 


124 




VrjuniUiIiaOUlU 


contains sinulsuity to C2 domains 


2iy 




125 






' i — — ^7=" J. — : — r — r — : 

guanine micleotiae oinomg protein beta subumt 

A 


693 


91) 


126 


AB021861 


Mils 

lUUoCUIUd 


apoptosis signal-regulating kinase 2 


153 


65 


127 




Homo ss^i^is 


conccntrative Nat-Hiucicosdc cotranspoiter 

DV/IN 1 J 


807 


97 


128 


M90360 


Homo sapiens 


protein kinase 


220 


73 


129 


r)^99ft9 




al[^a IC adrenergic receptor isofonn 2 


CIA 

574 


so 


130 




Homo sspiois 


iriiDD 


496 


67 


1^1 


AF9niTl/l 


JVIuS 

musculus 


testis specific serine kinase»3 


800 


87 




AT 1 i^ooO 


Bos tsurus 


difTerentiation enhancing &ctoT 1 


159 


74 






Homo s&pims 


phospholipase C-betcHl b 


554 


85 


134 


W74802 


Homo s^iens 


Human secreted protein encoded by gene 73 
Clone n^i/jbL.2j. 


1157 


87 






~TT = 

Homo sspiens 


Pancreas-specific gene 


668 


96 






Homo sapims 


A sec^Ttcd protein encoded by clone dt674 2. 


866 


98 


137 


AC002563 


Homo sapiens 


putative RHO/KAC effector protein; 95% 
similarity to r49205 (PID:gl34SSoO) 


5041 


99 


J JO 




-== s 

Homo salens 


rKL/J4J4, a novel secreted protem. 


891 


100 


139 


AB024034 


Arabidopsis 

thai IB nil 


DNA-damage indudble protein DDIl-likc 


147 


55 


140 


W97809 


Homo sapiens 


Human GTPase regulator GRAF. 


248 


56 




1^X33/ 


Homo sapiens 


Human fl.Az protem. 


125 


46 






Rsttus 
uurvcgims 


AMPA receptor binding protein 


623 


93 


143 


W26642 


Homo sapiois 


Human RECK cancer-inhibiting protein. 


641 


82 






Rattus 
norvegicus 


transmOTibrane receptor UNC5H2 ^ 


578 


84 






Homo s^icDS 


scavenger recqitor cysteine-iich type 1 protein 
M160 precursor 


727 


92 






Homo sapiens 


Human secreted protein 3. 


140 


40 


147 


M0A9M 


Homo sapiens- 


galactose-l"phos(^ate uncfyl transferase 


3li 


81 


148 


D64014 


Esdienchia 
coll 


HrsA 


818 


90 


149 




coli 


F u , '■ — 

pppcipp piiosphonydrolasc 


Q1 C 


!o 


150 


AL163279 


Homo sBpiftps 


hofnnlncr tn pAMP rp-^nnnQ^ ^iRirwit hmHmcr nml 

UvAIIVlwg wTUTU 1 VOUVUdC dOlimiL UlUUUlg Til Pi' 

beta tifm^Hiicin fflmilv nrmtmn^ 

t/vi4A u Ml iijLiuwm 1 cu ill 1 J 1 Pi iriviiiiT 






151 


AF179867 


Homo sapiens 


STE20-liJcc kinase 


940 


99 


152 


R95332 


Homo s^)iens 


lirand (clone 3TW^ 






153 


AF151859 


Homo sapiens 


CGI-101 protein 


370 


92 


154 


X66957 






toy 


of 


155 


Y16355 


Homo s^iens 


altemtiively spliced form 


432 


92 


156 


G0OSS7 




Unman »ini'm li* il «imfMn CCf^ TT^ XJO* ylO^ff 
niUImD Scucucu pTUlcm, OCA^ ILJ iHU. 




TO 


157 


AF159455 


Mus 

musculus 


zinc finger protein 


352 


74 


158 


L76191 


Homo salens 


interieukin-1 receptor-associated Idnase 


537 


76 


159 


AP001743 


Homo sq>iens 


putative gene, ankirin like, possible dual 
spectfity Ser/Thr/iyr kinase domain 


670 


98 


160 


AJ250425 


Rattus 
norvegicus 


Collybi^ I 


556 


74 


161 


G028S5 


Homo s^iens 


Human secreted protein, SEQ ID NO: 6966. 


370 


100 



107 



wo 01/57188 



PCT/USOl/03800 



SEQ 

ID 

NO: 


AccessiiMi 
No. 


Spedes 


DesCTtption 


SmiO* 

Waterman 

Score 


% 

Identity 


162 


Z22968 


Homo sapiras 


M130 antigen 


610 


100 


163 


AF 181 121 


Homo sapiens 


ATP-dependent Ca2+ pump PMRl 


336 


92 


164 


AF055636 


Homo sapiens 


leucine^cfa glioma-inactivated proteiii precursor 


455 


94 


165 


AF160798 


RaStus 
Dorvegicus 


calcium transporter CaTl 


700 


96 


166 


Y76332 


Homo sq}iens 


Fragment of human seoeted protein encoded by 
gene 38. 


327 


45 


167 


Y48607 


Homo sapiens 


Human breast tumour-associated protein 68. 


1072 


99 


168 


AB020741 


Mus 

musculus 


NIK-related kinase 


197 


43 


169 


AF252293 


Homo sapiras 


PAR3 


596 


44 


170 


U59429 


Cricetinae 
gen. sp. 


diacyiglycerol kinase eta 


481 


82 


171 


AF035268 


Homo sapiens 


phospbatidylserine-specific phospholipase Al 


386 


42 


172 


AF12708S 


Mus 

musculus 


semq>horin cytoplasmic domain-associated 
protein 3B 


507 


82 


173 


Y27918 


Homo ss])iens 


Human secreted protein oicoded by gene No. 
123. 


653 


99 


174 


G02979 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7060. 


538 


97 


175 


U36488 


Mus 

musculus 


embryonic stem cell phosphatase 


168 


55 • 


176 


W95629 


Homo sapiens 


Homo salens secreted protein gene clone 
gml96_4. 


1022 


100 


177 


AF289023 


Homo sapiens 


formiminotiansfeiBse cyclodeaminase form D 


255 


93 


178 


X04936 


Homo sapiens 


T-cdl receptor alpha-chain (413 is 2nd base in 
codon) 


710 


99 


179 


AF127481 


Homo saplms 


noD-ocogenic Sho GTPase-speciflc GXP 
exchange fitctor 


175 


80 


180 


G00978 


Homo siq>iens 


Human secreted protein, SEQ ID NO: 5059. 


517 


94 


181 


Y66645 


Homo sapiens . 


Mranbrane-bound protein PRO1310. 


671 


96 


182 


AFl 10640 


Homo sapiens 


orphan seven-transmembrane recqitor 


862 


100 


183 


AB020854 


Bostaurus 


orphan transporter short splicing variant 


766 


84 


184 


AF169691 


Homo sapiens 


cadhoin-like protein VR8 


375 


38 


185 


AF126372 


Homo sapiens 


thyrotropin-releasing hormone degrading 
ectoenzyme 


985 


99 


186 


L20966 


Homo salens 


phosphodiesterase 


541 


76 


18^ 


G02920 


Homo sapiens 


Human secreted protein. SEQ ID NO: 7001. 


254 


93 


188 


Y94918 


Homo sapiens 


Human secreted protein clone ddS04_18 protein 
sequraicB SEQ ID NO:42. 


301 


98 


189 


Y66713 


Homo s^iens 


Membian^-bound protein PRO1309. 


694 


100 


190 


G03244 


Homo s^iens 


Human secreted protein, SEQ ID NO: 7325. 


331 


73 


191 


U36771 


Rattais 
nonregicus 


sn-glycerol 3-phos[diate acyltransferase 


707 


92 


192 


R05935 


Homo sapiens 


Secreted GPIIb subunit of multiple subunit 
polypeptide (MSP)GPIIb-IIIa. 


157 


72 


193 


M92084 


Theileria 
parva 


casein kinase n alpha subunit 


364 


50 


194 


Y66645 


Homo sapiens 


Membrane-bound protein PRO 1310. 


448 


90 


195 


W95631 


Homo sapiens 


Homo salens secreted protein gene clone 
hj968_Z 


382 


49 


196 


AF255614 


Rattus 
norvegicus 


scafifolding protein SLIPR 


680 


99 


197 


AC021640 


Aiabidopsis 
tfaaliana 


putative jdios^diatidate pho^jbohydrolase 


300 


41 


198 


AF073967 


Mus 

musculus 
domesticus 


ol&ctoty recepttM- 


316 


43 


199 


W01730 


Homo sapiens 


Human G-protein receptor HPRAJ70. 


617 


98 


200 


AF117948 


Homo sapiens 


pancreas-enriched phospholipase C 


625 


89 


201 


AF128e25 


Homo sapiens 


CDC42-binding protein kinase beta 


636 


94 


202 


AFll'^ 


Homo Siqiieos 


Link guanine nucleotide exchange fiictor II 


1303 


100 


203 


Y53021 


Homo sapiens 


Human secreted protein clone qc646_l protein 
sequence SEQ ID NO:48. 


701 


99 


204 


AF227968 


Homo sapiens 


SH2-fi beta signaling protein 


182 


79 


205 


S81752 


Homo sapiens 


DPH2L?=candidate tumor su^iressor gene 


375 


100 



108 



wo 01/57188 



PCT/USOl/03800 



SEQ 

ID 

NO: 
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No. 


Species 
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Smith- 
Waterman 
Score 


% 

Identity 








{ovarian cancer oitical region of deletion} 






206 


U18315 


Susscroft 


paiatfayroid receptor 


122 


60 


207 


AF255342 


Homo sapiens 


putative pheromone receptor VlRLl long form 


170 


96 


208 


S52051 


RattQS sp. 


ncuTotransDiittcr transporter 


715 


94 


209 


W63683 


Homo sapiens 


Human seacted protein 3. 


840 


99 


210 


D79992 


Homo s^iens 


similar to Drosc^liila photoreceptor cell-^>ecific 
protein, cal]AotiiL 


541 


82 


211 


AFl 17948 


Homo sapiens 


pancreas'«nricfaed ptio^liolipasc C 


1348 


99 


212 


U81035 


Rattus 
norvegicus 


ankyrin binding cell adhesira molecule 
neurofiascin 


471 


69 


213 


AFl 54846 


Homo sapiens 


zinc finger protein 


798 


5^ 


214 


AFl 02777 


Ivlus 

musculus 


FY VK tinffCT-cofitainin? nhn^hoinn^itiflft kinav 

A A T 4_( AUA^WX V\JI 1 1*1111 1 life fUWdlVUVlllUSlUUV* ii in.T . 


933 


93 


215 


AL 163303 


Homo sapiens 


putative gene containing transmembrane domain 


523 


89 


216 


U26595 


Rattus 
norvegicus 


nrfl^nfflflnfllTl rm^frAtw npfliilAtnrv nmtMn 

precursor 


563 


78 


217 


GO409S 


14nrno <3inipiic 


fliiTTinn secreted nrrttpm TD'NO* SIT/^ 




yo 


218 


X75756 


Homo sapiens 


protein kinase C mu 


314 


81 


219 


y66723 


nuiiiu SHtificjis 


mCUlLfliUKT^DUUnU piVlCUl CmS\JX I w> 


77ft 




220 


D88577 


Mus 

musculus 




/ 


Afi 


221 




nuinu Sapiens 








222 


AF021935 


Rattus 
norvegicus 


mytonic dystrophy kinase-related CdD42-bmdtng 
Witiflsc 


636 


96 


•yn 




nuiuu supiviio 


u/ixi juu.i \i\ KJDasc \ri\jsj\.) sncnor pioiein 
H) 






224 




Homo sflpicns 


WIN 1 recepKir rnzzica-4 


oyu 


DO 


225 


AF030430 


Mus 

UlUSCUIUS 


semaphorin Via 


703 


68 






Eschcnc^iiB. 


put&uve ainyciroTQracetone icinase (EC 2.7.1^) 






227 




nUIIIU aajpiCIla 


piiu&puuiuusiioi j*pniis>piiaic~uii]uing proiciir'^ 


^vov 




228 


AB024573 


Mus 

11 lUdvU (US 


GTP-binding like protein 2 


265 


88 


229 


AF 122924 


lacvis 


Wnt iri]ii}iitnrv frtnt/wi^l 




An 

Hi/ 


230 


GO3205 


Homo sspicns 


Human secret^ protein, iildQ ID NO: 7286. 


229 


100 


231 


X98260 


Homo salens 


M-phase i^io^hoprotcin 1 1 


26S 


92 


232 


R92754 


Homo sspicns 


Human prowth differentiBtinn factnr-17 


682 


95 


233 


R75111 


Homo sapiens 


G]ycosy]-phos;^iatidylinosito]-specific 
pho^holq>ase-D. 


290 


100 


234 


W69431 




Hiifnfln wmnfvy) nrr^fyin rwl^')^ ^ 
iUiltUUX DCUdCU |/1ULC1U vWlXJJ J> 






235 


Y08686 


Homo sapiens 


serine palmitoyltransferase, subunit n 


859 


81 


236 


AF11827S 




aimpomtcinirai piuicui 




^7 


237 


X81466 


Mus 


Embiyo Brain Kinase 


460 


62 


238 


U64857 


Caenoiiiabdit] 


similar to the BPn/Kunitz family of inhibitors; 

mo^ ^imilnr tn tiosiie fiictnr nflthu/nv inhihitnr 

precursor (TFPI) 


284 


33 


239 


AJ250840 


Mus 

muscuhis 


serinc/Queonine protein kinase 


739 


63 


240 


AJ223472 


Mus 

musculus 


transcription elongation factor TFIISii 


222 


38 


241 


Y94906 


Homo sapiens 


Human secreted protein clone ii>649 3 protein 
sequence SEQ ID N0:18. ~ 


353 


52 


242 


AF169301 


Homo sapiens 


N&f/sulftte cotransport^ SUT-1 


591 


99 


243 


L22022 


Rattus 
norvegicus 


orphan transporter v7-3 


667 


93 


244 


AF016191 


Rattos 
norvegicus 


potassium channel 


1043 


98 


245 


AF097366 


Homo sapiens 


cone sodium-calcium potassium exchanger 


645 


98 


246 


Y29868 


Homo s^iens 


Human secreted protein clone pp325 9. 


497 


98 


247 


AF180475 


Homo s^iens 


Not4-Np 


188 


83 


248 


Y17227 


Homo sapiens 


Human seoeted protein (done yal-1). 


690 


99 


249 


AF2S0910 


Manduca 


death-associated small cytoplasmic leucine-ricfa 


182 


31 



109 



wo 01/57188 



PCT/USOiy03800 



SEQ 
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Species 


Description 


Smith- 


% 


ID 


No. 






Waterman 


Identic 


NO: 








Score 








sexta 


protein SCXF 






250 


AF192756 


Kaposi's 


Orf73 


134 


34 






sarcoma- 












associated 












heipesvinis 








251 


AB022694 


Homo sapiens 


MOK protein kinase 


209 


83 


252 


W55045 


Homo sapiens 


Neural adhesion molecule (edib0018£2 product). 


469 


100 


253 


L46815 


Mus 


DNA binding protein Rc 


251 


67 






musculus 








254 


W68S0S 


Homo sapiens 


Human acid sensing ionic chaaneL 


173 


82 


255 


AF070066 


Mus 


Citron-K kinase 


1201 


98 






musculus 








256 


002491 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6572. 


460 


100 


257 


Z1284I 


Oiyctolagus 


Pho^holipase 


368 


80 






cunicuhis 








258 


Y95436 


Homo sapiens 


Human calcium channel SOC-3/CRAC-2. 


1857 


99 


259 


AI222968 


Mus 


L-periaxin 


430 


72 






musculus 








260 


AJ250839 


Homo sapims 


serine/dueonine protein kinase 


861 


100 


261 


AJ249977 


Homo sapiens 


AMP-activated protein kinase gamma 3 subunit 


758 


98 


262 


AF141386 


Kattus 


SUT-2 


198 


40 






norvegicus 








263 


AF022859 


Homo sapiens 


neuiopilin-2(a0) 


335 


62 


264 


AF160477 


Homo sapiens 


Ig superfimily receptor LNIR precursor 


387 


91 


265 


Y44662 


Homo sapiens 


Human 14273 G-protetn coupled reenter 


636 


99 








(GPCR). 






266 


U27269 


Mus 


sodium glucose cotianspoiter 


204 


56 






musculus 








267 


AF124491 


Homo sapiens 


ARF OTPase-activatiDg protein .GIT2 


159 


75 


268 


AF127389 


Rattns 


putative taste receptor TRl 


209 


39 






norvegicus 








269 


X98296 


Homo sapiens- 


nbiquitin hydrolase 


215 


95 


270 


X78482 


Strcptococcus 


Fc-gamma receptor 


129 


26 






pyogenes 








271 


AB009883 


Nicotiana 


KED 


109 


26 






tabacum 








272 


AF137367 


Mus 


VPSIO domain receptor protein SORCS 


899 


97 






musculus 








273 


L34938 


Rattus 


ionotropic glutamate receptor 


460 


86 






norvegicus 








274 


AL022724 


Homo sapiens 


dJ4I3H6.1.1 Qiamster Androgen-depoidoit 


188 


74 








Expressed Protein LIKE PUTATIVE protein) 












(isoform 1) 






275 


AF265555 


Homo salens 


ubiquitin-conjugating BIR-domain erayme 


173 


94 








APOLLON 






276 


G02872 


Homos^ens 


Human secreted protein, SEQ ID NO: 6953. 


148 


56 


277 


L40380 


Homo sapiens 


thyroid receptor interactor 


430 


61 


278 


AB0468S1 


Homo sapiens 


KIAA1631 protein 


283 


96 


279 


AC008075 


Arabidc^sis 


Contains PFj00069 Eukaryotic protein kinase 


157 


43 






dialiana 


dom^n. 






280 


M83738 


Homo SE^iens 


protein-tyrosine phosphatase 


18] 


73 


281 


AK024397 


Homo sapiens 


unnamed protein product 


439 


91 


282 


AF141326 


Homo sfpims 


RNA helicase HDByDlCEl 


497 


84 


283 


AF156530 


Mus 


BTS-domain transcriptional rq)rcssor PEl 


60S 


76 






musculus 








284 


Y29336 


Homo sapiens 


Human secreted protein clone cs756_2 alternate 


647 


100 








reading fiame protein. 






285 


Y73402 


Homo sEipiens 


Human secreted protem clone yc25_l protein 


300 


90 








sequence SEQ ID NO:26. 






286 


AF016411 


Homo sapiens 


KCNA3.1B 


137 


100 


287 


W89253 


Homo sapiens 


Human ALP. 


688 


97 


288 


AFl 12886 


Bos taunis 


differentiation enhancing &ctor 1 


750 


96 


289 


AFl 13131 


Homosapiois 


host cdl fector homolog LCP 


367 


44 


290 


U52111 


Homo sapiens 


plexin-related protein 


698 


100 


291 


AF026S04 


Rattus 


SPA-1 like protein pl294 


603 


89 



110 



wo 01/57188 



PCTAJSOl/03800 



ID 

NO: 


No. 


Species 


Des<^ptioD 


Smith- 
Waterman 
Score 


% 

Identic 














292 




norvegicus 


r rr— i a r: 

membroDO-associatea guanylatB uDas&* 




53 


293 


X99211 


Drosophila 


ubiquitLD-specific protease 


143 


38 


294 


Y94943 


Homo sapiens 


Human secreted protein clone ytl4_] protein 


18S 


94 


295 


Y94890 




l-liTYnnn nmtmi nlrtrM* V4MM'709 

nuiiiaii prvicul wiuuc £ua/^/7o< 






296 


AFO 19767 




£M\\t linger piuidii 




9o 


297 


Y28568 




ocwcLcu pcpuuc uunc IHU / / 1* 




Oil 

84 


298 


Y94943 


xiuiiiu aopiciui 


nunian sccieicQ proEcm cionc y[i4 i protcm 


1 DO 

loZ 


97 


299 


B089O6 


Homo sapieais 


Human secreted protein sequence encoded by 
gene]6S£QIDNO.'63. 


605 


69 


300 




numu sapiens 


Hum&n-32 cadhenn-ielated molecule. 


oil 

212 


97 






Homo ssipiens 


nenFopil!n-2(aO) 


277 


100 


302 


Y71124 


Homo sapiens 


Human mitogenic regulator duax2. 


716 


97 


mi 




Homo sapiens 


Human receptor tyrosine IdnaseL 


228 


97 


304 


D32050 


Homo s^iens 


alanyl-tRNA synthetase 


192 


80 


305 


U43586 


Homo sapiens 


protein kinase related to Raf protein kinases; 
Method: conceptual translation supplied by 
antiior 


428 


72 


306 


R54872 


Homo sapiens 


Hunan H13 viral receptor mutant 4. 


280 


95 


307 


JL//»3'AZ 


Mus 

musculus 


membrane glycoprotein 


199 


41 


308 


Arz35ol4 


Raltus 
norvegicus 


scafiolding protein SUPR 


639 


88 


309 


S79463 


Mussp. 


scmaphorin homoIog^M-Sema F 


162 


89 


3Z0 




Homo sapiens 


ATP-binding cassette sab-fimily A member 2 


736 


100 


311 


U03413 


Dictyostelium 
discoideum 


calcium binding protein 


151 


36 


312 


Y87347 


Homo sapiens 


Human signal peptide containing protein HSH*- 
124 SEQ ID NO:124. 


744 


100 


313 


Z97055 


Homo sapiens 


dr388M5.4 (putative GS2 like protein) 


789 . 


99 


314 


ACU04U1U 


Homo sapiens 


similar to Leucine-ricb transmembrane proteins; 
44% similanty to U42767 (PID:gl736918) 


197 


38 


315 


AL021392 


Homo sapiens 


dJ439F8.2 (supported by GENSCAN and 
UbN£WlS£} 


278 


38 


J iu 




MUS 

musculus 


polycystic kidney disease 1 (NTOtein 


165 


38 


317 


AF109643 


Rattus 
norvegicus 


coxsackie-adenovirus-receptor bomolog 


223 


38 


318 


AF104923 


Homo sapiens 


putative transcrqition fector 


138 


84 




API nfV3R7 


Tiypanosoma 


activated protein kinase C recqrtor homolog 


141 


38 


320 


G00588 


Homo Sirens 


Human secreted protein, SEQ ID NO: 4669. 


125 


51 




I AU71 


Homo sapiens 


Human sccrBted protein (clone CC332-33). 


459 


97 


■yn 




Homo sapiens 


tmman type 1 mositol l,4,S*'tn^hosphate 


232 


97 


323 


Y27918 




nuiuon ocMoim piuiciii cncoucu ny gcuc jnol 
123. 




SB 


324 


AF010144 


Homo sapiens 


neuronal thread protein AD7chN'I'F 


209 


70 


325 


M19650 

ATX J ^W^W 


Vlnfnn cnniMiQ 


£ j3 ~v/i#llV"IlUCiBOIlUC J ^UUSpnUCUCSlCraSC ^ErV« 

3.1.437) 




Vl 


326 


W80396 


Homo sapiens 


A secreted protein encoded by clone bp646_l 0. 


140 


70 


327 


X75756 




piuicui &uiaac w mu 


>4U 


78 


328 


G02292 




(JTlimnn cjw^v^mI n*v\t*m ^X1C\ 111 'KirV t^TX 

miiiirn ocdcioo pioiciii, o£A^ ixf OJ id. 


71 1 


nfi 
VJ 


329 


AF168990 


Hoino sapiens 


putative Ol'l'-binding protein 


877 


99 


330 


S67984 


Homo sapiens 


anti-HIV gpl 20 antibody heavy chain variable 
region 


581 


80 


331 


X13916 


Homo sapiens 


LDL-receptor related precursor (AA -19 to 4525) 


2823 


98 


332 


Y87330 


Homo sapiens 


Human signal pq>tide containing protein HSPP- 
107SEQK>NO:107. 


1127 


100 


333 


Y28503 


Homo sapiens 


HGFH3 Human Growfli Factor Homologue 3. 


320 


98 


334 


ACX)02563 


Homo sajMcns 


putative RHO/RAC effector protein; 95% 


327 


93 



111 



wo 01/57188 



PCTAJSOl/03800 



SEQ 

ID 

NO: 


Accession 
No. 


Species 


Desoiption 


Smith- 

Watennan 

Score 


% 

Identity 








similarity to P49205 (PID:gl345860) 






335 


Y87347 


Homo sq>iais 


Human signal peptide containing protein HSPP- 
124 SEQ ID NO. 124. 


nil 


67 


336 


AF006466 


Mus 

muscnlus 


lymphocyte specific fbimin related protdn 


193 


75 


337 


AF265555 


Homo salens 


ubiquitin-conjugating BIR-domain enzyme 
APOLLON 


632 


97 


338 


Y13443 


Homo sapiens 


Amino acid sequence of bSlo3-2. 


516 


100 


339 


Y07637 


Homo ss^iens 


putative GABA-gated chloride channel 


189 


100 


340 


¥05734 


Homo ^ims 


Human Grb7 effector 2.2412 protein. 


2156 


99 


341 


AE000497 


Escherichia 
coli 


L-idonate transcriptioDal regulator 


928 


98 


342 


D90855 


Escherichia 
coli 


glycerol-3-phosphate dehydrogenase (EC 
1.1.99.S) chain A, anaerobic 


769 


99 


343 


D8S613 


Escherichia 
coli 


membrane component 


399 


100 


344 


M93239 


Escherichia 
coli 


transmembrane protein 


232 


100 


345 


M60177 


Escherichia 
coli 


enterobactin 


759 


99 


346 


D90699 


Escherichia 
coli 


Sensor nratein conS CFXZ 1 7^ -"^ 


638 


VI 
y 1 


347 


D90843 


Escheridiia 
coli 


CqjB protein. 


552 


100 


348 


M13422 


Escherichia 
coli 


49 kd protein 


1193 


96 


349 


L10328 


Escherichia 
coli 


similar to drug resistance tnmslocases 


340 


90 


350 


X69942 


Mus 

musculus 


enhaiicer-tr^>-lociis-l 


560 


82 


351 


AF239613 


Homo sapicDs 


activated potassiom channei 




ou 


352 


D90777 


Escherichia 
coli 


^^hvdmxvhiitvrvl-T^A HphvHmcrR!nn.Qp f H'C^ 

1.1.1.157) (b- hydroxybutyiyl-CoA 
ddiydiogenase) (BhbD). 


577 


inn 


353 


D90863 


Escherichia 
coli 


similar to 


311 


98 


354 


Y52386 


Homo sapiens 


Human transmembrane protein HP02000. 


133 


58 


355 


Y31645 


Homo s^ims 


Human transport-associated pTotein-7 (TRANP- 
7). 


482 


55 


356 


Y58637 


Homo sapiens 


notein regulating gene expression PRGE-30. 


119 


51 


357 


AF119226 


Homo s^iens 


dual-specificity tyrosine pho^hatase YVHl 


1788 


100 


358 


Y87219 


Homo sapiens 


Human secreted protein sequence SEQ ID 
N0238. 


165 


100 


359 


J00132 


Homo sapiens 


t>eta-fibrinogen 


233 


93 


360 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7870. 


128 


70 


361 


R28916 


Homo sapiens 


Type m procollagen (prior art). 


108 


40 


362 


UI66S5 


Rattus 
noivegiciis 


phospholipase C deltar4 


649 


65 


363 


G03119 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7200. 


95 


42 


364 


W7276 


Oallus gallus 


chicfcoi brain &cton-2 


104 


34 


363 


G03789 


Homo s^iens 


Human secreted protein, SEQ ID NO: 7870. 


183 


65 


366 


G04091 


Homo se^iens 


Human secreted protein, SEQ ID NO: 8172. 


118 


46 


367 


X982S8 


Homo s^iens 


M-phase phosphoprotein'9 


564 


75 


368 


AL021366 


Homo ^piois 


CICK0721 Q.3 (Kioesin related protein} 


3387 


99 


369 


U70932 


Peromyscus 
leucopus 


reverse transcriptase 


92 


59 


370 


X86400 


Homo sapiens 


gamma subunit of sodium potassium ATPase 
like 


242 


73 


371 


G03172 


Homo s^>iens 


Human secreted protein, SEQ ID NO: 7253. 


165 


36 


372 


U49974 


Homo s^iens 


mariner transposase 


257 


55 


373 


X13916 


Homo sapiens 


LDIweceptor related precwsor (AA -19 to 4525) 


21193 


99 


374 


AF234765 


Ratttts 
norvegicus 


smne-Qiginine-ridi slicing regulatory protein 
SRRP86 


1182 


78 


375 


U49974 


Homo sapiens 


mariner transposase 


172 


55 



112 



wo 01/57188 



PCTAJSOl/03800 



SEQ 


Accession 


Species 


Description 


Smith- 


% 


ID 

NO: 


No. 






V^fltefTnflfi 

TT OtfcjM 1 1 ifll 1 

Score 




376 


G01984 


Homo S{q>iens 


Human secreted protein, SEQ ID NO: 6065. 


221 


67 


377 


G00669 


Homo sapiens 


Human secreted protein, SEQ ID NO: 4750. 


600 


100 


378 


X52574 


Mus 

musculus 


O if binding protein 


1456 


91 


379 


R69095 


Ifomo sapiens 


Anti-HIV Fab tatSl light chain. 


68 


37 


380 


J04974 


Homo sapiens 


alpha-2 type XI collagen 


125 


37 


381 


AB00240S 


Homo sspicas 


LAK-4p 


530 


43 


382 


U64830 


Dictyostelium 
discoideum 


protdn ^Tosine kinase 


115 


44 


383 


G02916 


Homo sapiens 


Human secreted protBiti, SEQ ID NO: 6997. 


618 


98 


384 


001194 


Homo sapiens 


Human secreted protein, SEQ ID NO: 3275. 


617 


93 


38S 


AI245822 


Homo sapiens 


type I transmembrane receptor 


4560 


100 


386 


D86974 


Homo sapiens 


KIAA0220 


2148 


98 


387 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7284. 


142 


50 


388 


G04072 


Hchho sapiois 


Human secreted protein. SEQ ID NO: 8153. 


99 


59 


389 


M12140 


Homo salens 


envelope protein ^ 


107 




390 


AJ293309 


Homo sapiens 


NHP2 protein 






391 


Y42751 


Hcxno sqiiens 


Human calcium binding protein 2 (CaBP-2). 


181 


94 


392 


W48351 


Homo salens 


Hiiinfln hrcA^ mnnv n^ntp/l nmtf^n Rf^TY^ 

xiuilioil mWfliTl wOIH.%rf AwlaU^U U] UldU EJ^^^^^^Jl^. 




DO 


393 


Y14442 


Hcmio sapisis 


oli^ictnn/ rRc^ntnr mntpin 




3*1 


394 


W85607 


Homo spirals 


Secreted iHDtein clone <la228 6. 


957 


100 


395 


Y76332 




I'ragniOTt ot hmnsn secreted [XDtciii raroded by 
grac 3S. 


171 


34 


396 


G03930 


Homo sapiens 


Human secreted protein, SEQ ID NO: 8011. 


250 


100 


397 


AB032904 


Hrvl/tna two 

rLyujua vss 


dop&niuie receptor 04 


105 


35 


398 


AJ007798 




suuliuii cUUigCIl yoXtWJj) 


ool 


.83 


399 


Y91405 


Homo sqiiens 


Human secreted protein sequence encoded by 


1047 


92 


400 


Y29861 


Homo sapiens 


Human secreted protein clone cb98 4. 


162 


37 


401 


D87002 


Homo sapiens 


similar to rat integral membrane glycoprotBin; 

*u«c<s£>iuu nuinocr £>zi3i 3. 


527 


78 


402 


AF J 00754 




ouwcui uuii{uiijuus piuiciu iwjirx isoionn 




ys 


403 


X74S)04 


111 1 1 1 J r " 






£.f\ 


404 


AF075462 


\lus 

musculus 


Al iP— riKnwIntinn fn r trtr—H i rT»r> toH fW^a^a 
rlJL'f ~llin/ajrJaUUll lllU.Ur*^lUbClDU Vjix^sse 

activating protein isoform b 


D4D 


oy 


40S 


X92887 


Human 

endogenous 

lelrovirasK 


pol/env 


ioz 


JU 


406 


Y30162 


Homo sapiois 


Human dorsal root receptor 4 hDRR4. 


323 


72 


407 


AK022626 


Homo salens 


unnflineH nirttpin nTw1ii(*t 




QO 


408 


L138Q2 


Homo stqiiens 


ribosmal protein small subunit 


264 


92 


409 


Y91600 


ITnmo mmienQ 


Liuiiiou ocvicicu piuicu] sequence cncoucu oy 
gcne9SEQIDN0273. 


17oo 


QO 


410 


W88745 


Homo sapiens 


k^fvicmi piuicui cuuuucu oy Kcnc 3U ctonc 
HTSEV09 






411 


AB043953 


musculus 


Chat-H 




RO 
OA. 


412 


Y86233 


Homo sapiens 


Human se^'eted protein 11^X1^1X29, SEQ ID 
NO:14& 


1014 


92 


413 


U10542 


Pan 

troglodytes 


MHCclassIA 


265 


71 


414 


AF155097 


Homo sapiens 


NY-REN-7 antigen 


830 


93 


415 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7284. 


88 


48 


416 


Y57911 


Homo sapiens 


Hiunan trarismembrane protein HTMPN-35. 


266 


89 


417 


W27651 


Homo salens 


Secreted protein AT205. 


481 


60 


418 


Y76884 


Homo sqtiens 


Retiixjblastoma binding protein-7sequence. 


3077 


87 


419 


AF255559 


Notothenia 
coriiceps 


alpha tubulin 


289 


68 


420 


G019S4 


Homo sapiens 


Human scocted protein, SEQ ID NO: 6065. 


209 


74 


421 


AL109827 


Homo sapiens 


dJ309K20.2 (acrosomal protein ACR55 (similar 
to rat ^perm antigen 4 (SPAG4))) 


1446 


96 


422 


AC008075 


Arabidopsis 
tfaaliana 


F24J5.4 


112 


35 



113 
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SEQ 

ID 

NO: 


Accession 
No. 


Species 


DescripticHi 


Smitfa- 

Watennfln 

Score 


Ifiemtitv 


423 


AF231705 


Homo salens 


A]u co-repressor 1 


1090 


100 


424 


AF234887 


Homo sapiens 


FLAMINGO 1 


6268 


97 


425 


Y35942 


Homo sq)iens 


&ct£ndcd human secreted ]nt)tein sequrace, SEQ 
IDKO. 191. 


1961 


99 


426 


AB009288 


Homo sapiens 


N-copine 


63S 


98 


427 


L12392 


Homo sapiens 


Huntmgton's Disease protein 


16080 


99 


428 


Y94990 


Homo s&picns 


Human ornrftted niYiiein vK7 1 1 5!F.O TO Kr>*7n 


too 


on 


429 


AJ293573 


Homo ss^iois 


zinc fin?^ nrotein CcTanne 




R7 


430 


Y84441 


Homo s^iens 


Amino acid sequence of a human RNA- 
assoa&ted protein. 


2074 


inn 

lUU 


431 


G028SO 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6931. 


723 


95 


432 


G04067 


Homo sapiens 


Human secreted proteii^ SEQ ID NO: 8148. 


73 


47 


433 


AF1S9296 


LycopeisicoD 
esculentum 


extensin-like protein 


613 


HO 


434 


W48351 


Homo sapiens 


Human txcast c&nccr related protein BCRB2. 


135 


AA 
ft 


435 


X73874 


Homo sapiens 


{riio^horylase kinase 


3442 


y / 


436 


AF161426 


Homo s^iens 


HSPC308 


268 


■74 


437 


Y30812 


Homo sspiois 


HiiTnsn QRCmttt^ nrvitpin pnrrvlMl fiftiii pmip 
*-*mn*m Dvvs^tiKU |/ivbdii wubwucu uuui ^dic 


loss 


3^ 


438 


G03798 


Homo sapiens 


Human ^ecref^ mY>^tn_ SSPCi TT) NO* 7R7Q 


loo 




439 


X14766 


Homo sapiens 


GABA-A rec^toT alpha 1 subunit 


2294 




440 


X02344 


Homo sapiens 


beta-tii hii lin 




yj 


441 


AF16S418 


Homo oanienq 


WrU T till 11^ alfy wil Wiuicgicuur I 




• inn 

lUU 


442 


LI 1672 


Homo sapiens 


zinc finger protein 


795 


54 


443 


(303203 


JUmiU MUllblU 


Hniimn g^rrpfiprt nvnt'^tn ^TJf~\ TTI "Mf*%- TOfi/t 
niiniaii ai-viCtPi piuLCuif oCV{ 11^ nw. ixJy*. 




20 


444 


A52140 


UlUUCU lUICU 






100 


445 


¥093-10 


jxuinu sopicus 


ryanodine receptor 2 




99 


446 


AFl 16712 


nuulu SN^Jl(3l5 


ns\j£. loo 


ZZ7 


49 


447 


AF245447 




spujiigosMuc Ainasc lypc ^ isoiorui 


j/O 


99 


448 






memtM3ne~type serine protense 1 




y4 


449 


U87305 


Rattus 

IlUl vc^cus 


transmembrane receptor UKCSH 1 


817 


93 


450 


AF081249 


Homo sapiens 


JAWl-rcIatcd protein MRVIIA long isofoim 


4568 


99 


451 


AC00549K 


l^f^nnn ponton 0 

nuiuu MipiCiUS 




ilo 


62 


452 


M60235 


xunuu sopiciis 


gnmuic umDuranc pi uicur* i fu 




VJ 


453 


AB036706 


Homo sapirais 


mtelccdn 


730 


88 


454 


VJwU7l0 


Homo Sflpicns 


Human secreiea proiem, oJbv^ ill r*\j. 4yw. 


263 


81 


455 


Y22634 


xiumu oapiciis 


Hum&n cytokine inducible r^ulatory protein-l 

(cmp-i). 




67 


456 


Y36705 


1 Imiio cianiM^e 
XlUUlU Ml^iGUa 


Fragm^t of human secreted inrotein encoded by 
gaie62. 






457 


N9132S 




DNA encodmg human growth hormme receptor. 


3282 


96 


458 


M19ISS 


PlasmodioiD 


S-antigen precursor 


no 


36 


459 


Y13377 


Homo sapiens 


Amino acid sequence of protein PR0257. 


509 


98 


460 


Y02693 




Human secreted protein encoded by gene 44 




43 


461 


Y14482 


Homo sapiens 


Fragment of human secreted protdn encoded by 
gsie 17. 


184 


54 


462 


Y53005 


Hfimn Qnnipnc 


Hlimnn cp-frpfrprf nmi'Rin r^lnnA nmTAO fi nmtwm 

niiiimii an*rcicu |MUicjm viuuc pni /*ry__o pjuicm 

sequence SEQ ID N0:16. 


133 


4/ 


463 


X84960 


Triticiwn 

111 u wmi 1 

Bcstivum 


luw luuicwujiir wcigm giuicuiui 


1 AO 

Ivy 


33 


464 


W19919 


Homo yapienig 




1781 


•85 


465 


AF189764 


Mos 

muscuhis 


alpha/beta hydrolase-] 


502 


59 


466 


U93569 


Hnmn ^niAfK 


p40 




3U 


467 


Y41528 


Homo ss^iens 


Fragment of hnman seoeted protein encoded by 
gene 77. 


1172 


99 


468 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6953. 


149 


52 


469 


AJ000008 


Homo sapiens 


PI3-kinase 


5832 


97 


470 


X70922 


Mus 

muscuhis 


neurotoxin homologue 


118 


47 


471 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7878. 


198 


75 


472 


Y36705 


Homo sapiois 


Fragment of human secreted protein encoded by 


72 


57 



114 



wo 01/57188 
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SEQ 

ID 

NO: 


Accession 
No. 


Species 


Descnption 


Smith" 

Waterman 

Score 


% 

Identity 








gene 62. 






473 


G02313 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6394. 


328 


100 


474 


Y07007 


Homo sapiens 


Breast cancer associated antigen precursor 
sequence. 


1013 


97 


473 


W93234 


Homo sapiens 


Human ESRPl piotein. 


943 


80 


476 


W48351 


Homo sapiens 


Human breast cancer related p-otein BCRB2. 


236 


65 


477 


Y02693 


Homo sapiens 


Human secreted protein encoded by gene 44 
clone HTDAD22. 


202 


60 


478 


G01870 


Homo sapiens 


Human secreted protein, SEQ ID NO: 3951. 


267 


100 


479 


AF102777 


Mus 

muscuhis 


FYVE finger-containing phosphoinositide kinase 


3427 


92 


480 


003052 


Homo sapiens 


Human secreted protein. SEQ ID NO: 7133. 


123 


53 


481 


W87701 


Homo sapiens 


A human membrane fusion protein designated 
SYTAXl. 


221 


77 


482 


G03119 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7200. 


131 


39 


483 


AF210651 


Homo sapiais 


NAG18 


124 


39 


484 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


343 


50 


485 


G00637 


Homo sapiens 


Human secreted protein, StQ ID NO: 4718. 


129 


70 


486 


U15174 


Homo s^iens 


BCL2/adenovini5 ElB 19kD-interacting protein 
3 


149 


73 


487 


Y76167 


Homo sapiens 


Humfln secreted protein encoded by gene 44, 


627 


100 


488 


AJ275213 


Homo sapiens 


stabilin-1 


1244 


91 


489 


G03798 


Homo sapiens 


Human secretol nmtetn. SKC 1 ID NO- 7ft7Q 






490 


L12392 


Homo sapiens 


Hinrtinptnti**: nmtein 


16081 


inn 


491 


003789 


Homo sapiens 


Human secreted oraliran. SFO 111 "NO' 75170 


107 


oo 


492 


J03799 


Homo sapiens 


laminin-binding protein 


228 


70 


493 


U15174 




3 






494 


Y02693 


Homo sapiens 


Human secreted protein encoded by gene 44 
clone HTDAD22 


197 


67 


495 


AC005175 


Homo sapiens 


K31449 3 


oof 


yr 


496 


G03786 


Homo sapiens 


Human %£7etal nmtein SFO ID "NO* 78^7 


229 


Ol 


497 


AB030237 


Canis 
fimiliaris 


D4 dopamine receptor 




■♦o 


498 


G02872 


Homo sapiens 


Human secreted protein, tiEQ ID NO: 6953. 


228 


63 


499 


1X70935 


Peromyscus 
manicailatiis 


reverse transcriptase 


213 


52 


SOO 


imso8 


Homo sapiens 


skeletal muscle ryanodine receptor 


26406 


99 


501 


003371 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7452. 


105 


38 


502 


AFl 19851 


Homo sapiens 


PR01722 


136 


62 


503 


AFl 13685 


Homo sapiens 


PRO0974 


116 


30 


504 


U79458 


Homo sqiiens 


WW domain 'binding protein-2 


322 


59 


505 


W2%51 


Homo sapiens 


Human secreted protein CJD124 3. 


608 


J J 


506 


W85459 


Homo salens 


Secreted protein encoded by clone dhl 135 9. 


986 


70 


507 


Y86265 


Homo sapiens 


Human secreted protein HUSXE77, SEQ ID 
NO:180. 


1 15 




508 


AL160175 


Homo sapiens 


bA243J16J (similar to MYLK (myosin, light 
polypeptide kinase)) 


184 


92 


509 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


97 


62 


510 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7870. 


117 


63 


511 


W79092 


Homo sapirais 


Human secreted protein do740_3. 


1038 


100 


512 . 


AF010144 


Homo ss^iens 


neuronal fiiread protein AD7c-NTP 


205 


64 


513 


AJ133439 


Homo sapiens 


GRIPl protein 


2151 


100 


514 


AE003456 


Diosophila 
melanogastBT 


CG6393 gene product 


259 


42 


315 


zmo6 


Xenopus 
laevis 


p46XlEg22 


128 


40 


516 


AFl 04413 


Homo sapiens 


large tumor suppressor 1 


1766 


94 


517 


G03797 


Homo S!q>iais 


Human secreted protem. SEQ ID NO: 7878. 


92 


40 


318 


AFl 51 083 


Homo sapiens 


HSPC249 


444 


98 


519 


S80864 


Homo S4>iens 


cytochrome o-like polypeptide 


318 


50 


520 


X92485 


Plasmodium 
vivax 


pval 


170 


61 



115 



wo 01/57188 
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SEQ 

ID 

NO: 


Accessinn 
No. 






omiui- 


% 


521 


G03790 


Homo sapiens 


Human secreted nrotein. SEt) ID NO* TJHl 


159 


so 


522 


AF121857 


Homo sapiois 


sotting nexin '7 




tv 


523 


G026S4 


Homo sapiens 


Human <;ecreta1 nmtetn SPO TntJO* fiT\^ 


ox 


J / 


524 


W88627 


Homo sapiois 


Secrete/I nml'pin ptinnH/yl hv omp QA nlnni* 
HPMBQ32. 




fj 


525 


AFl 19851 


Homo sapims 


PR01722 


1& 




526 


Y27761 


Homo sapiens 


Hurnan secnneted nmtcin cncnHeri hv opnr'Mn AfJ 






527 


G02707 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6788. 


70 




528 


U47924 


Homo sapiens 


C8 


1 1 XX 


DO 


529 


G04063 


Homo sapiens 


Human ^ecret^ nmtein !>«P/) ID'Nn* MdA 


Vr 




530 


G03203 


Homo ss^icns 


HinTian ^n^crptnl nmtrtn ^Pi~k irfMr)* TiStA. 

iJiuiiiou dvVlvlMl piUibiil, OX^A^ IxJ XNVa § £JOf\m 


111 
111 


Ov 


331 


G04067 


Homo sapiens ■ 


Human co^rptiyt nmtetn ^RT) IT) XTO- SII^ 


yx 


AC 
03 


532 


G03267 


Homo sapiens 


Human (:ecrpt«1 nmtpin ^PO ll'^'MD- TXAA 




^O 


533 


G03203 


Homo sapieas 


Human secreted protein, SEQ ID NO: 7284. 


182 


48 


534 


AF068286 




rxiyvrivix^ J or^ 


ODl 


100 


535 


U07707 


Homo sapiens 


epidermal growth fector receptor substrate 


228 


60 


536 




Homo sapiens 


Human secreted procem, ocQ uJ NO: oOio, 


484 


75 


537 


nrJt I y^j£. 


Oallus gallus 


<]in-jnduced kinase 


206 


53 


538 


AF135022 


Homo sapiens 


mediator 


128 


100 






Homo sapiens 


Human secreted protein, SEQ ID NO: 7348. 


141 


59 


540 


AF016430 


Caenoifaabditi 
s elegans 


contains similarity to a BR-C/ITK. domam 


853 


39 


541 


AC003093 


Homo sapiens 


OXYSTEROL-BINDING PROTEIN; 45% 
similarity to P220S9.(PID:gI29308) 


408 


66 






Homo sapiens 


integrin alpha subunit precursor 


517 


81 


543 


AF102530 


Mus 

musculus 


ol&ctDiy receptor F3 


327 


73 


544 


Y73431 


Homo sapiens 


Human secreted protein clone ybl86_l protein 
sequence SEQ ID NO:84. 


386 


100 






Pseudomonas 
aeruginosa 


probable TonB-dependent receptor 


279 


42 


546 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7874. 


264 


53 






Homo ssqiiens 
— 


A human monoqrte-macrophage apolipoprotein 
B receptor protein. 


1772 


67 


548 ' 




noroo sapiens 


Human secreted protein scquoicc encoded by 
gcne43SEQIDNO:166. 


176 


100 






Homo sapiens. 


numan secreted protem, b£Q lO NO: 5652. 


777 


99 


550 




Homo sapiens 


protein reg^lati^g cytokinesis 1; PRCl 


1953 


88 


551 




Homo sapiens 


Human secreted protein done pe584 2 fnotcin 
setjuencc 


1224 


94 


352 


X98330 


Homo sapiens 


xyanodine receptor 2 


24621 


99 


353 


Y42782 


Homo sapiens 


riuman D&no it a i protein. 


684 


95 


554 


AB0252S8 


musculus 


granuphilin-a 


50] 


41 


533 




nUlJlU SflfllCIIo 




1468 


100 


556 


W92388 


Homo sapiens 


Human iK-mteracbng protein S239a. 


538 


92 


557 


AFI 108^1 


nuiDo sopiciu 




175 


59 


558 


API I77« 


nuuio Sapiens 


ttQToid honnone rec^tor-associsted protein 
complex component TRAPI 50 


183 


32 


559 


G02872 


XIUUHJ N^lCIlS 


Unman nnoi aatf^rt vhw^Iv*!** Ol>^^ TT\ lkT^\. 

numan sccreieo prosein, oJM/ iij riv/: c>73j. 


319 


6S 


5^ 


086214 


Mus 

fniicoitiiie 
lllUobUiUS 


Ca2+ dependent activatDr protein for secrcticm 


1010 


93 


561 


AF187325 


&miliaris 


ulciniiniuii anogcu 


2X1 


55 


562 


AJ001981 


jnuulu Mipicmt 




2512 


99 


563 




Kamis 

noivegicus 


glutamate receptor subtype delta-1 


338 


66 


564 


W30638 


Homos^iens 


Partial human 7-transmembrane receptor 
HAFO 1 67 protein. 


371 


100 


565 


AC00562O 


Homo sapiens 


R33590 1 


467 


97 


366 


Y99358 


Homo sapiens 


Human PR01772 (UNQ834) amino add 
sequence SEQ ID NO:63. 


1138 


78 


567 


ALOSim 


Homo sapiens 


dJ889M153 (novel protein) 


1002 


58 


568 


AF151043 


Homo sapiens 


HSPC209 


798 


100 



116 



wo 01/57188 



PCT/DSOl/03800 



SEO 

ID 

NO: 


Accession 
No. 


Species 




WolcniKKl 

Score 


To 

iQcnnxy 


569 


AF097518 


Homo sapiens 


liver-specific transporter 


231 


100 


570 


AB035698 


Homo sapiens 


Misshapen/NQC-related kinase MINK-1 


1532 


100 


571 


Y07096 


Homo sapims 


Colon canc^ associated antigen precursor 
sequence. 


1064 


100 ' 


572 


AL03n77 


Homo sapiens 


dJ8g9M153 (novel protein) 


735 


55 


573 


Y66639 


Homo sapiens 


Mranbrane-boimd protein PRO290. 


254 


45 


574 


AB037108 


Homo s^icQs 


scvwi tionsmcmbranc dfxnain oiph&n receptor 


loo J 




575 


D43949 


Homo sapiois 


Xtiis grae is noveL 




inn 


576 


Y48596 


Homo sapiens 


Human breast tiunour-associated protein 57. 


108 




577 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 4433. 


141 


75 


578 


R95913 


Homo dinipTiQ 


trial HtryaH nmt^n 
l^dUm UUvctU UlUUTUL. 


\Ar\ 
IW 


03 


579 


AK025116 








7U 


580 


Y86473 


Homo sapiens 


Human gene S2-encoded protein frqgmen^ SEQ 


77 


70 


581 


rir ITU # ly 


Jiuuiu Sapiens 


JIvilO protein 


450 


100 


582 




IXnmn caniAno 
imUlU alipiCUa 


prDtclH 


330 


98 


583 




femfliaris 


LM dopamine receptor 


64 


56 


584 






numao secrccea protein, ocv ^ tiK), o /uz. 


345 


90 


585 


AL096828 


Homo sapiens 


dJ963E22.1 (Novel protein similar to NY-KEN-2 
Antigen} 


268 


85 


586 


Y30819 


Homo sapiois 


Human secreted protein encoded from gene 9. 


235 


35 


Sin 

JO/ 




Homo sapims 


Human secreted protein, S£Q ID NO: 4438. 


132 


56 


388 


G02872 


Hmno salens 


Human secr^ed.protein, SEQ ID NO: 6953. 


182 


79 






mtiscuhis 


2P1 protein 


764 


80 


jVO 


Woo627 


Homo sapiens 


Secreted protein encoded by gene 94 clone 
HPMBQ32. 


329 


81 


SOI 




Homo sapieais 


Ammo acid sequence of a human secreted 
protein. 


110 


43 






Homo s^)ieii5 


A human seven transmembrane signal transducer 
polypeptide. 


1369 


92 


593 


Y33031 


Homo sapiens 


Human secreted protem clone ddl 19_4 protein 
seqiKuce oci^ wj ri\j. lus. 


1112 


97 


594 




"u : 

Homo SE^icns 


Human secreted protein encoded by gene No. 92. 


763 


79 


595 


G03798 


numu stq^icus 


xiuman secrcieo proiem^ aci^ ixj lixj, /o/^. 


136 


58 


596 


API511 in 
nr 1.111 iv 


iiiiifWjiinia 


MJxi proiGiu 


Z215 


95 


597 


G03786 




Tollman CMrrv4>^H nmt»m VlVf) llfKIfS- TflA'T 
ouiiiaii acucicu piuicui^ omM^ UJ riKJm /oO/. 




o5 


598- 


AFl 92499 


Mus 


putative secreted protein ZSIG37 


143 


40 


599 


AF11985S 




PROl RAT 




7o 


600' 


G02872 




f4limnn CAnmfArl nmt^m VKfl mur^* £0^? 
miUJaU MTtrlwlCU pjvicu], kU Fi\Jm oy^J* 




to 


601 


Y00295 


Homo ^snien^ 


Htimnn ^P(*n*tftH nmtptn mc^vlnl hv am^^ ^9 

AlUIUCUl a^VlwLBU UiUkwlU wllbUUCU DY KGiUC JO* 




OQ 
OO 


602 


AF184971 


Homo cnnipn^ 


cln<:<: 1 f fvtnlrinp rpr-mt'nr 7f ^V'I'OR'7 




'JA 
/** 


603 


AF061936 


Homo sapiens 


diacylglycerol kinase iota 


773 


96 


604 


AL096828 


TToniA fifmipnit 


uj70Jxz>^H6.t ^i^uvci piuiGui bixniiar vo r% i^iuspi'^ 
Antigen) 


1 


!o 


605 


AB033106 


TInmo ^ntpnv 


KTAA12X0 nmtein 




inn 


606 


X75756 


Homo sapigis 


nrotein Irinivcp mii 




OO 


607 


D86983 


Homo sapiens 


sunilnr to 1 ) m p| tmno a qtpj nprnvirlnotn/lf 11 1 


^7^R 


OO 


608 


W69341 


Homo sapiens 


Secreted protein of clone 6(279 1. 


1377 


99 


609 


W88627 


T7omA cnnipnc 


•7n<mvu pniicu] cuc^ucicu ny gene Clone 
HPMBQ32. 




oz 


610 


Y27868 


XlV/llIU otipiciio 


xiuiuaii secreiea protein encooeu py gene iNO. 
107. 


iio 


OZ 


611 


AF202636 


Homo sapiens 


angiopoietin-like protein PPl 138 


2164 


100 


612 


AF090944 


Homo sapiens 


PRO0663 


218 


82 


613 


y02693 


Homo sapiens 


Human secreted protein encoded by gene 44 
clone HTDAD22. 


195 


59 


614 


M87053 


Rattus 
norvegicus 


lens membrane protein 


450 


84 


615 


AC004232 


Homo sapiens 


FPM315 


163 


37 


616 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6065. 


205 


79 



117 



wo 01/57188 



PCT/USOl/03800 



ID 

NO: 


No. 


Species 


Descriplion 


dinitn- 

Waternian 

Score 


ToZ 1 

Identity 


fin 


Y91524 


nuuiu o3picu£ 


-jj —J- 

Human sccreteu protein secpjcncc cncooeo oy 

amp- lA ^VCt ID Kn>1Q7 
{{cue OEA^ iU iWJ.lVia 


5Z1 




Us 


AJ24S621 


Homo sapiens 


CTL2 protein 


2258 


99 


vMy 


I /0170 


Homo sapiens 


Human secreted prutem encoded by gene 75. 


108 


64 


620 


AF067864 


Homo sapiens 


tiansfenin recqrtor 2 alpha 


3922 


94 


621 


D90721 


Escheridiia 
coJi 


Transmembrane protein dppC 


573 


90 


q£2 


W75o5o 


Homo sapiens 


Human secretoiy protein of clone CS752-3. 


730 


lOO 






Homo sapiens 


Human secreted protein vbl2_l, SEQ ID NO:4. 


733 


100 


624 


At 034745 


Mus 

musculus 


LhTXpSO 


637 


83 






Paramecium 

bnrsaria 

L.nloreUa 

VITUS ] 


FK^-ncn, IPITNMSLPLS (3x) 


94 


46 




u /yzou 


Homo sapiens 


unknown 


194 


70 


All 




Homo sapiens 


Neural thread protein. 


99 


30 


628 


G03450 


Homo sapiens 


Hmnan secreted protein, SEQ ID NO: 7531. 


427 


100 


629 


Y36281 


Homo sapiens 


Human secreted protein encoded by gene 58. 


590 


lOO 


630 


Y02693 


Homo sq>iens 


Human secreted protein encoded by gene 44 
clone Hn>AD22. 


165 


76 


631 


G02139 


Homo salens 


Human secreted protein, SEQ ID NO: 6220. 


268 


96 


632 


U16996 


Homo sapims 


protein tyrosine po^hatase 


351 


80 


633 


AF121857 


Homo sapiens 


sorting nexin 7 


2019 


100 


634 


AF283772 


Homo sapiens 


similar to Homo sq>iens ribosomal protein LIO 
encoded by GenBank Accession Number 
L25899 


340 


77 


635 


Y07090 


Homo sapiens 


Renal cancer associated antigen precursor 
sequence. 


277 


64 


636 


AB013382 


Homo sqiiens 


DUSP6 


414 


76 


637 


G02872 


Homo salens 


Human secreted protein, SEQ ID NO: 6953. 


315 


71 


638 


M95762 


Rattus 
norvegicus 


GABA transporter 


924 


89 


639 


(303789 


Homo sapims 


Human secreted protein. SEQ ID NO: 7870. 


219 


60 


640 


Y01400 


Homo s^iens 


Secreted protein encoded by gene 18 clone 
HNHF029. 


137 


79 


641 


AC0U8U75 


Aiabidopsis 
thaliflna 


F24I5.4 


121 


33 


04Z 


W74o24 


Homo sapiens 


Human secreted protein encoded by gene 96 
Clone haqxIajd i . 


615 


62 




A DAI com 


— ~- ; 

Homo salens 


serine/ threonine kinase 


485 


98 


0*T*r 




Homo s^nens 


£hiinan secreted protein fragment encoded from 
gaie23. 


162 


46 






Homo ss^ens 


membrane protcm DAPl 0 


474 


100 


646 


AF233323 


Homo sapiens 


Fas-associated pbosphatase-1 


200 


38 


vtl 




Homo sapiens 


Homo sapiens clone BK158_1 protein. 


1203 


99 


Crto 




Homo salens 


^"WW>«Mr III, 1, — II t II T II 

cuxi w-iiKB protem 


1440 


98 


649 


Y36203 


Homo salens 


Human secreted protein #75. 


233 


73 






Homo salens 


Human secreted protein, SEQ ID NO: 6953. 


173 


78 


6S1 


Y32199 


Homos^ens 


Human receptor molecule (REQ encoded by 
Ingrte clone 20Z2379. 


1012 


100 






riyJooaies 


dopamine recqntorlM 


122 


32 


^53 


AK021848 


Homo sapiens 


muiamed protein product 


186 


69 






Homo sapiens 


Human seoeted protein encoded by Gene No. 
1 < 

13. 


57 


37 


655 


L224SS 


Rattus 
norv^icus 


mu opioid receptor 


116 


34 


656 


G03112 


Homo sqiiens 


Human secreted protein, SEQ ID NO: 7193. 


110 


45 


657 


G02345 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6426. 


459 


97 


658 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 clone 
HnvlBQ32. 


291 


75 


659 


G02832 


Homo sapiois . 


Human seethed protein, SEQ ID NO: 6913. 


134 


65 


660 


Y91423 


Homo s^iens 


Human secreted protein sequence encoded by 
gene 11 SEQ IDNO:144. 


333 


96 



lis 



wo 01/57188 



PCT/OSOl/03800 



SEQ 

ID 

NO: 


Accession 
No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identity 


661 


G03789 


Homo sqiiens 


Human secreted protein, SEEQ ID NO: 7870. 


168 


68 


662 


Y53886 


Homo s^ens 


A suppressor of cytoldne signalling protein 
designated HSCX>P-6. 


375 


43 


663 


W75771 


Homos^ens 


Human OTP binding protein APD08. 


629 


100 


664 


AL096770 


Homo sapiens 


bA150A6.2 (novel 7 transmembrane receptor 
(rfaodopsin femDy) (olftctory receptor like) 
protein (hsO^l-ai)) 


480 


55 


665 


AB037734 


Homo sapiens 


KLAAl 3 13 protein 


978 


96 


666 


W82841 


Homo sapiens 


Human cerebral protein-1. 


192 


84 


667 


W82841 


Homo sapiens 


Human cerebral protein-1. 


182 


87 


6^ 


AB030184 


Mus 

musculus 


contains transnembrane (TM) region and ATP 
binding region 


757 


68 


669 


AB032919 


Hylobates 
muelleri 


dopamine receptor D4 


8S 


37 


670 


AF107295 


Rattus 
norvegicus 


outer membrane protein 


746 


81 


671 


Z33642 


Homo sapiens 


leukocyte sur&ce protein 


394 


93 


672 


W85608 


Homo sapiens 


Secreted protein clone du41D_S. 


261 


91 


673 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7284. 


106 


48 


674 


AL035587 


Homo sapiens 


dJ475N16.4(KIAA0240) . 


2388 


99 


675 


Y59668 


Homo sapiens 


Secreted protein 108-005-5-0-Cl-FL. 


1134 


53 


676 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7878. 


174 


74 


677 


AF026954 


Bos taums 


pyruvate ddiydrogenase pbosfdiBtase r^ulatoiy 
subunit precursor; PDPr 


1013 


95 


678 


LI 1625 


Mus 

musculus 


receptor protein-tyrosine kinase 


545 


96 


679 


AL031427 


Homo sapiens 


dJ]67A193 (novel protein) 


745 


100 


680 


AJ133430 


Mus 

musculus 


olfactory receptor 


528 


77 


681 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6613. 


179 


70 


682 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7870. 


336 


76 


683 


Y94943 


Homo sqiiens 


Human secreted protein clone ytI4_l protein 
sequence SEQ ID N052. 


118 


100 


684 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


100 


37 


685 


G00885 


Homo sapiens 


Human secreted jprotein, SEQ ID NO: 4966. 


162 


60 


686 


AK001518 


Homo sqiiens 


unnamed protein product 


590 


100 


687 


G01982 


Homo sapiens 


Human secreted protein. SEQ ID NO: 6063. 


718 


100 


688 


Y92241 


Homo sapiens 


Human cancer associated antigen precursor 
(Ma-REN-4g^ 


2405 


99 


689 


AC024792 


Caenoriiabditi 
selcgans 


contains stmilatily to T[LP783 16 


423 


36 


690 


Y27868 


Homo sapiens 


Human secreted protein encoded by gene Na 
107. 


183 


81 


691 


Y56514 


Homo sapiens 


Human Jnikat cell done P2-1S AIMIO longest 
ORF protein sequence. 


180 


88 


692 


Y27795 


Homo sapiens 


Human secreted protein encoded by gene No. 79. 


1539 


99 


693 


Y36268 


Homo sq>ia]s 


Human secreted protein encoded by gene 45. 


428 


98 


694 


U1246S 


Homo s^iens 


ribosomal protein 135 


308 


89 


695 


Y45272 


Homo sapiens 


Human seoxted protein encoded fiom gene 16. 


1517 


99 


696 


AF191838 


Homo s^iens 


TANK biodbig kinase TBKl 


1242 


98 


697 


Y02693 


Homosq>iens 


Himwn seateted protein encoded by gene 44 
clQnBHTDAD22. 


275 


75 


698 


Y87280 


Homo sapiens 


Human signal p^de containing protein HSPP- 
.57SEQIDNO:57. 


576 


90 


699 


Y97999 


Homo sapiens 


Human SCAD fimily molecule HSFM-1, SEQ 
IDNO:l. 


729 


99 


700 


AJ006701 


Homosqnens 


putative serine/tiireonine protein kinase 


610 


79 


701 


AF209198 


Homos^ens 


zinc finger protein 277 


2357 


100 


702 


AJ298841 


Mus 

musculus 


tarsioA protein 


709 


45 


703 


AK021729 


Homo sapiens 


nnnamcd protein product 


622 


98 


704 


Z46787 


Caeooriialxliti 
selegans 


similar to GlutarexiQxin, Zinc finger, C3HC4 
type (RING finger) 


920 


51 


705 


G02882 


Homo salens 


Human secreted protein, SEQ ID NO: 6963. 


589 


98 



119 



wo 01/57188 



PCTAJSOl/03800 



SEQ 

ID 

NO: 


Accession 
No. 


Species 


Description 


Smith- 

Watoman 

Score 


% 

Identity 


706 


G02501 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6582. 


125 


58 


707 


R95326 


Homo sapiens 


Tumor necrosis factor tecq>tor 1 death domain 
llgand (done 2DD). 


121 


95 


708 


QO3002 


Homo salens 


Human secreted protein. SEQ ID NO: 7083. 


125 


39 


709 


Y96202 


Homo sapiens 


UcappaB kinase (IKK) binding protein, Y2H56. 


516 


98 


710 


M63577 


Saccharomyc 
escerevisiae 


SFPl 


131 


59 


711 


AB026291 


Rattus 
norvegicus 


acetoacetyl-CoA synthetase 


467 


85 


712 


D21211 


Homo sapiens 


im>tein tyrosine phosphatase (PTP-BAS, type 3) 


368 


44 


713 


AF044033 


Mannota 
marmotB 


olfisctoiy receptor 


615 


83 


714 


G03561 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7642. 


251 


lOO 


715 


AB033062 


Homo sq>iens 


KIAAl 236 protein 


1380 


100 


716 


G0O577 


Hcnno S24>icns 


Human secreted protein, SEQ ID NO: 4658. 


80 


73 


717 


Y96864 


Homo sapiens 


SEQ. ID. 37 fiom WO0034474. 


835 


99 


718 


AJ243396 


Homosspiens 


voltage-gated sodium channel beta-3 subunit 


234 


100 


719 


U47334 


Homo sapiens 


similar to chicken gamma aminobutyric acid 
receptor beta4 subunit 


578 


99 


720 


AB020S98 


Homo s^>iens 


peptide transporto- 3 


1096 


100 


721 


Y53886 


Homo salens 


A suppressor of cytoldne signalling protein 
designated HSCOP-6. 


570 


74 


722 


J0S046 


Homo sapiens 


insulin rec^tor-relalcd recqitor 


6787 


100 


723 


AF001958 


Ambystoma 
tigrinum 


electrogenic N»i- bicarbonate cotranqraitei; 
NBC 


111 


41 


724 


AF127084 


Mus 

muscttlus 


semaphorin cytoplasmic domain-associated 
protein 3A 


5253 


94 


725 


X54673 


Homo sapiens 


GABA transporter 


3114 


99 


726 


AF016191 


Rattus 
norvegicus 


potassium channel 


370 


100 


727 


AB029559 


Rattus 
norvegicus 


BATl 


139 


35 


728 


Y28503 


Homo sapiens 


HGFH3 Human Growth Factor Homologue 3. 


2186 


97 


729 


AJ01141S 


Homo sapiens 


plexin-Bl/SEP receptor 


729 


56 


730 


Z93096 


Homo sq>iens 


bK390B3.1 (manic fringe prosophila) 
homolog) 


142 


68 


731 


Z10062 


Homo sapiens 


cDNA oicoding a human vanitloid recqitor 
homologue Vanibepl . 


675 


99 


732 


AF161382 


Homo sapiens 


HSPC264 


492 


94 


733 


AB029033 


Homo sapiens 


KIAAl 110 protean 


3826 


99 


734 


AE000493 


Escherichia 
coU 


putative transpoit protein 


592 


97 


735 


AL033379 


Homo sapiens 


dJ41 7022.2 (novel 7 transmembrane Koeptoi 
(rfaodiqisin family) proteiD similar to high- 
afBnity lysopbo^bittidic add receptor homolog) 


2173 


99 


736 


AF132599 


Homo sapiens 


RAN 1 US fector of late activated T lympbocytes- 
1 


245 


56 


737 


X55019 


Homo sapiens 


acetylcholine receptor ddta subunit 


883 


99 


738 


X91906 


Homo sapiens 


voltage-gated chknide ion channel 


1978 


100 


739 


AB026116 


Homo sapiens 


organic anion transporter 4 


1444 


98 


740 


D00570 


Mus 

muscuhis 


open reading fiame (196 AA) 


83 


24 


741 


W03626 


Homo sapiens 


Human thyrotropin GPR N-teimina] scqooiceL 


118 


40 


742 


U66059 


Homo sapiens 


V_segment translation product 


614 


100 


743 


AF11981S 


Homo sapiens 


G-protein-coup!ed receptor 


2751 


99 


744 


X16663 


Homo sapiens 


haematopoietic lineage ceU protein (AA 1-486) 


148 


93 


745 


W67838 


Homo sapiens 


Human secreted protein encoded by gene 32 
clone HLTa63. 


448 


95 


746 


W57260 


Homo sapiens 


Human semaphorin Y. 


2414 


100 


747 


W21578 


Homo st^iens 


Alzheimer's disease protein encoded by DNA 
fiom plasmid pGC82232. 


968 


65 


748 


Y94935 


Homo sapiens 


Human secreted protein clone yd218_l protein 
sequence SEQ ID NO:76. 


622 


100 


749 


AL022238 


Homo sapiens 


dJI042K10.5 (novel protein) 


314 


85 


750 


003889 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7970. 


391 


87 



120 



wo 01/57188 



PCTAISOl/03800 



SEQ 

ID 

NO: 


Accession 
No. 


Spedes 


Descnption 


g^j^j^j^^ 

Watennan 
Score 


% 

Identity 


751 


ABa252S8 


Mus 

musculus 


giaraiphilin-a 


773 


41 


752 


Y52386 


Homo sapiois 


Human transmembrane protcm HP02000. 


900 


99 


753 


Y48586 


Homo s^iens 


Human breast tumour-assotiated proton 47. 


2527 


99 


75* 


AJ272207 


Homo sj^iens 


putative G protein-coupled reenter 92 


694 


100 


755 


M8SI83 


Rattus 
noTvegicus 


vasopressin receptor 


979 


68 


756 


AF190501 


Homo s^ens 


leucine-ri(A rq>eat-containing G protein-coupled 
iec^tor6 


388 


71 


757 


Y02692 


Homo s^iiens 


Human secreted protein encoded by gene 43 
clooeHTADXn. 


461 


R7 


758 


Z22535 


Homo s^jicDS 


ALK-3 


439 


70 


759 


R04932 


Homo s^nens 


jntftyfenQni^smrnfl receptor gggynent frpm clone 
39 responsiblefor binding the target 


564 


07 
71 


760 


W74902 


Homo s^ieos 


Hiimwn secreted nmtein enonrlRd Kv oeno 1 ^^ 

clone HE8BI92. 


1217 


77 


761 


G03706 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7787. 


223 


88 


762 


AB020676 


Homo supiciis 


KIAA0869 Diotein 




OQ 


763 


AIC026992 


Homo s^iens 


unnamed protein product 


2285 


99 


764 


AFl 73358 








iUU 


765 


AF268066 


musculus 


nctriyi 4 






766 


Y48585 


XIUIIJU 9I1XJ1CU9 


nuuiau ureiEH iuinoup*assuciaisu proicm 'k>. 


1 1 £0 


oy 


767 


AF230378 


musculus 


UliraidUUIr'l uClIB 




45 


768 


AF121975 


miLcciilim 


odorant receptor SI 8 


268 


62 


769 


AB008S15 


Homo sapiens 


RanBPM 


611 


57 


770 


Y09945 




putative in|cigr&] nKmbrane transport protem 


456 


50 


771 


AF226731 




AD026 


Qoo 


OQ 


772 


Y27132 


omo Alliens 


OA004FG^ 




1 nil 


773 


X87832 


Homo s^riens 


NOV/plexin-Al protein 


1821 




774 


AB0252S8 


Mus 

musculus 


granuphilin-a 


500 


41 


775 


AF12510I 


Homo s^pims 


HSPC040 protein 


232 


93 


776 


G02815 


Hcnno sspicns 


Human ^ecret^ niYitein !^KO 1 11 'NO* i^SlQfi 






777 


G02493 


HonK> sflpfens 


Human secreted nratetn SRO IIJ MO* fiS7jl 


1Q1 


Oo 


778 


R03301 


Homo sapiens 


iiequence of pre-fauman atrial natriuretic pq>tide. 


213 


45 


779 


AL357374 








inn 


780 


AF100346 


Homo sapiens 


neuronal voltage gated calcium channel gamma- 


1434 


89 


781 


Y19566 


Homo sapiens 


Amino fU*iH Cfiniiftncv n hiimtm gf^*r<Hw1 

protein. 


- lUJ 




782 


Y36233 


Homo sapiens 


Human secreted protein encoded by gCTc 10. 


1098 


93 


783 


AF084464 


Kflttus 
norvegicus 


(jlF-binding protein REM2 


141 


30 


784 


W49042 


Homosqnens 


Human low density l^x>protein binding proton 
LBP-3. 


2693 


S>9 


785 


AF238381 


Homosqnens 


PTDVl 


1904 


91 


786 


Y91870 


Homo sapiens 

~ 


Human apoptosis related protein. 


547 


100 


787 


Y71062 


Homo sapiens 


Human membrane transport protein, MTKP-7. 


1062 


94 


788 


AFl 17754 


Homo s^ois 


thyroid hoimone receptor-associsted protein 
complex component TRAI'240 


8684 


98 


789 


AL049569 


Homo s^ens 


dJ37C10.3 (novel ATPase) 


2848 


96 


790 


AF151848 


Homo salens 


CXjI-90 protein 


745 


96 


791 


Y08639 


Homo sapiens 


nuclear orphan receptw ROR-beta 


1421 


95 


792 


Y4I706 


Homosqaens 


Human PR0381 protein sequence. 


644 


99 


793 


AF121228 


Homo sqnens 


tiiyroid honnone receptor-associated protein 
complex component TRAP95 


1037 


100 


794 


G04072 


Homo sq>iens 


Human seaeted protein, SEQ ID NO: 8153. 


124 


62 


795 


Y69384 


Homo sapiens 


Amino acid sequence of a 14274 receptor 
protein. 


119 


100 


796 


W40215 


Homo sapiens 


Human macrophage antigea 


1358 


99 



121 



wo 01/57188 



PCT/USOl/03800 



SEQ 

ID 

Ntt 


Accession 
No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identity 


797 


AF258340 


Homo sapiens 


hqiatoceUular carcinoma-associated antigen 112 


1151 


99 


798 


AF159615 


Homo s!q)iais 


FGF receptor activating protein 1 


461 


98 


799 


Y59863 


Homo »piens 


Human normal uteris tissue derived protein 26. 


797 


99 


800 


W70459 


Homo sapiens 


Human Tl -receptor ligand III splice variant 2. 


572 


92 


801 


L00073 


Homo sapiens 


renin 


1913 


93 


802 


P92219 


Homo sq)iais 
(human) 


CKJ protein. 


II963 


97 


803 


X15357 


Homo sapiens 


ANP-A receptor preprotein (AA -32 to 1029) 


5199 


98 


804 


W64473 


Homo sapieas 


Human secreted protein from clone ECl 72 1. 


4018 


95 


805 


AJ243874 


Homo sapiens 


oIigophTenin-4 


2067 


100 


806 


G01731 


Homo sapiens 


Human secreted protein, SEQ ID NO: 581Z 


284 


100 


807 


Z24680 


Homo sapiens 


gaip 


1562 


83 


808 


AF171669 


Homo sapiens 


glycoprotein-associated amino acid transporter 
LAT2 


1364 


90 


809 


W70321 


Homo sapiois 


Secreted protein CC198 1. 


1154 


96 


810 


W74843 


Homo sapiens 


Hunan secreted protein encoded by gene 1 IS 
clone HOVBA03. 


855 


99 


811 


AF108831 


Homo sapiens 


K:C1 cotcanspotter 3 


4561 


100 


812 


AF092135 


Homo sapiens 


PTD014 


862 


100 


813 


AF2X3772 


Homo sapiens 


similar to Homo sapiens ribosomal protein LIO 
encoded by GenBank Accesaon Nimiber 
L25899 


784 


100 


814 


C301563 


Homo sapiens 


Human sealed protein, SEQ ID NO: 5644. 


330 


100 


813 


AF051151 


Homo sapiens 


Toll/interimkin-l receptor-like protein 3 


3850 


99 


816 


W95630 


Homo sapiens 


Homo sapiens secreted protein gene ck»e 
gnll4_l. 


358 


100 


817 


G01082 


Homo sapiens 


Human secreted protein, SEQ ID NO: 5163. 


549 


100 


818 


AF151800 


Homo sapirais 


CX3I-41 protein 


1106 


95 


819 


L00352 


Homo sapiens 


low density lipoprotein receptor 


3980 


100 


820 


X04434 


Homo sapirais 


IGF-I receptor 


5832 


99 


821 


G03844 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7925. 


572 


100 


822 


AF212220 


Homo sapiens 


TERK 


396 


48 


823 


Y50125 


Homo salens 


Human glycophosphatidylinositol-anchored 
protein GPI-12Z 


4897 


99 


824 


AF156778 


Homo sapiens 


ASB-3 protein 


2675 


98 


825 


AF096322 


Homo sapiens 


neuronal voltage-gated calciinn channel gamma- 
2sDbunit 


1105 


100 


826 


Y07972 


Homo sapiens 


Human secreted protein fragment #2 encoded 
from gene 28. 


1540 


100 


827 


AB032013 


Homo sapiens 


potassium channel Kv8. 1 


2433 


95 


828 


Y13620 


Homo sapiens 


BCL9 


5284 


96 


829 


Y91474 


Homo sapiens 


Human secreted protein sequence encoded by 
gene24SEQIDNO:147. 


541 


98 


830 


X54232 


Homo sapiens 


glyiMcan 


1625 


87 


831 


XI 4830 


Homo sapiens 


acetylcholine receptor beta-subunit preprotein 


2540 


100 


832 


Y71262 . 


Homo sapiens 


Human chondromodulin-lilce protein, Zdunl. 


1002 


100 


833 


G03873 


Homo sq>iens 


Human secreted protein, SEQ ID NO: 7954. 


638 


96 


834 


AC003030 


Homo salens 


R29828 1 


1389 


93 


835 


Y38422 


Homo salens 


Human secreted protein. 


964 


87 


836 


U41557 


Caenoifaabditi 
selegans 


glycine-rich 


85 


36 


837 


AL121889 


Homo salens 


dJ1076E17.1 (KIAA0823 protein (continues in 
AL023803)) 


998 


75 


838 


AJ011413 


Homo sapiens 


plexin-Bl/SHP receptor 


1580 


60 


839 


W80398 


Honoo sapiens 


A seraeted protein encoded by clone cwl 543 3. 


1105 


67 


840 


000862 


Homo sapiens 


Human secreted protein, SEQ ID NO: 4943. 


255 


92 


841 


G026S0 


Homo sqiiens 


Human secreted protein, SEQ ID NO: 6731. 


644 


97 


842 


AF036717 


Homo ssqpiens 


FGFR sigmlling adaptor SNT-1 


2629 


99 


843 


Y73446 


Homos^ens 


Human secreted protein clone yc27_l protein 
sequence SEQ ID N0:1 14. 


1089 


100 


844 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6953. 


357 


69 


845 


AF1S1810 


Homos^ens 


CGI-S2 protein 


1443 


88 


846 


X83378 


Homo sapiens 


putative chloride diannel 


1620 


99 


847 


AC004883 


Homo sq>ieiis 


similar to general transcription &ctor 21; similar 


653 


96 
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wo 01/57188 



PCTAISOl/03800 



SEQ 


Accession 


Species 


Description 


Smitfa- 


% 


ID 


No. 






Watoman 


Identic 


NO: 








Score 










to AF038969 (PID:g2827207) 






848 


X99886 


HcHDO sapiens 


monocyte cfaemotactic protein-2 


160 


76 


849 


AC005587 


Homo sapiens 


similar to motise ol&ctoiy receptor 13; similar to 


963 


98 








P34984 (PlD:g464303) 






830 


AB038237 


Homo sapiens 


G protein-coupled recq>tar CSL2 


1767 


100 


831 


AF124490 


Homo sapiens 


ARF GTPase-activating protein GITl 


3415 


98 


832 


Y86217 


Homo sapiens 


Human secreted protein HWHGU54, SEQ ID 


1189 


99 








NO:132. 






833 


AF224741 


Homo sapiens 


chloride channel proton 7 


3748 


99 


834 


X17094 


Homo sapiens 


furin (AA 1-794) 


3550 


99 


835 


W78245 


Homo aliens 


Fragment of human seoeted protein encoded by 


1245 


99 








gaie 19. 






836 


R97569 


Homo sapiens 


Intei1eiikui-2 receptor associated protein p43 . 


1926 


100 


837 


Y41765 


Homo sapiens 


Human PRO1083 protein sequence. 


3211 


99 


838 


AF057306 


Homo sapiens 


transmembrane pioteoUpid 


481 


84 


839 


AK023116 


Homo sapiens 


unnamed protein product 


374 


69 


860 


Y413I2 


Homo sapiens 


Human secreted protein encoded by gene 5 clone 


824 


100 








HLDRM43. 






862 


Y23776 


Homo sapiens 


Human secreted protein encoded from gene 66. 


895 


99 


863 


Y74188 


Homo sapiens 


Human prostate tumor EST fragment derived 


96 


30 








protein #375. 






864 


AF167473 


Homo sapiens 


heme-binding protein 


870 


99 


863 


G02532 


Homo sapims 


Human secreted protein, SEQ ID NO: 6613. 


211 


67 


866 


X54870 


Homo sapiens 


Type H integral membrane protein 


1201 


100 


867 


G00700 


Homo ss])i»is 


Human secreted protein, SEQ ID NO: 4781. 


640 


99 


868 


Y07894 


Homo salens 


Human secreted protein fragment encoded from 


388 


88 








gene 43. 






869 


J00123 


Homo salens 


prcproeokephalin ( 


1349 


93 


870 


Y91632 


Homo sapiens 


Human secreted protein sequence encoded 


1048 


98 








gene 25 SEQ ID NO:305. 






871 


L04311 


Homo sapiens 


GABA-alpba receptor beta-3 subunit 


237 


93 


872 


Y29988 


Homo sapiens 


Human (Ttokine family member EF-7 protein. 


960 


94 


873 


AF161382 


Homo sapiens 


HSPC264 


1124 


99 


874 


G03412 


Homo sapiens 


Human seoi^ protein, SEQ ID NO: 7493. 


464 


100 


875 


Y27572 


Homo sapiens 


Human secreted protein encoded by gene No. 6. 


573 


96 


876 


M13530 


Homo si^iens 


B-cell growth factor 


171 


36 


877 


W63681 


'Homo sapiens 


Human secreted protein 1. 


1652 


99 


878 


L27867 


Rattus 


neurexophilin 


1448 


98 






norvegiciis 








879 


Y10835 


Homo sapiens 


Ammo acid sequence of ahuman seoeted 


321 


100 








protein. 






880 


W88991 


Homo sapiens 


Polypeptide fragment encoded by gene 144. 


936 


100 


881 


AFl 18670 


Homo sapiens 


orphan G protein-coi^led receptor 


1971 


100 


882 


AF20S86S 


Homo sapiens 


EDRF 


328 


100 


883 


Y18462 


Homo sapiens 


calhepsinL 


209 


72 


884 


Y94950 


Homo sapiens 


Human secreted protein clone dhl073_I2 protein 


348 


too 








sequence SEQ ID N0:106k 






885 


AF070661 


Homo sapiens 


HSPC005 


404 


100 


886 


Y04313 


Homo sapiens 


Human secreted protein encoded by gene 23. 


385 


100 


887 


X92744 


Homo sqiieos 


hBD-l 


375 


100 


888 


Y22496 


Homo sapiens 


Human secreted protein sequence clone 


994 


94 








cn621 8. 






889 


Y41293 


Homo sapiens 


Human sohible protem /71AIPO-I . 


4595 • 


99 


890 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7795. 


147 


63 


891 


AF2088S6 


Homo sapiens 


BM-014 


1012 


99 


892 


U29195 


Homo s^iens 


neurooal pentraxin D 


2002 


98 






Homo sapiens 


Buricitt lymphoma receptor 1 




100 


894 


Y94914 


Homo sapiens 


Human secreted protein clone pw337_6 protein 


537 


100 








sequence SEQ ID NO:34. ~ 






895 


W6I630 


Homo sapiens 


Clone HNFGW06 of EGFR receptor femily. 


326 


63 


896 


M24]10 


Homo sapiens 


G0SlS>-2 peptide precursor 


481 


100 


897 


Z68747 


Homo sapiens 


nnogen 38 


2018 


99 


898 


AF186112 


Homo s^iens 


neurokinin B-Iike protein ZNEUROKl 


619 


100 


899 


AF223420 


Homo sapiens 


AD025 


734 


100 
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PCT/USOl/03800 



SEQ 
ID 

wa 


Accession 
No. 


Species 


Descripticm 


Smitfa- 

Wateman 

Smmc 


% 

Ideutify 


900 


P60657 


Homo sq)iens 


Sequence of human lipocoitin. 


1835 


100 


901 


M27288 


Homo sq>iens 


oncostatinM 


1297 


99 


902 


W85737 


Homo salens 


Polypeptide with transmembrane domain. 


749 


100 


903 


001349 


Homo sapiens 


Human seoeted protein, SEQ ID NO: 5430. • 


650 


99 


904 


Y00261 


HQmos{q>iea5 


Human secreted protein encoded by gene 4. 


1133 


99 


905 


AF039688 


Hcmio sapiens 


antigen NY-CO-3 


771 


99 


906 


AB007836 


Homo sapiens 


Hic-5 


2544 


100 


907 


AB017507 


Homo sapiens 


Apgl2 


224 


100 


908 


AK0OO0S6 


Ifomo salens 


unnamed protein product 


1537 


98 


909 


Y86299 


Ifomo sapiens 


Human secreted protein HFOXB55, SEQ ID 
NO:214. 


427 


100 


910 


AF231C23 


Homo sapiras 


pTotocadherin Flamingo 1 


7393 


99 


911 


Y 14 134 


Homo sapiens 


Vascular endotiieli&l cell growth inhibitor beta 
prot^ sequence. 


1319 


100 


912 


Z90420 


Homo sapiens 


Human GDP-3 (hGDF-S) pc^ypeptide encoding 
<©NA. 


1950 


100 


913 


Y 19757 


Homo salens 


SEQ ID NO 475 fiom W09922243. 


1361 


100 


914 


C503172 


Homo sapiens 


Human secreted protein, SEQ ID NO; 7253, 


112 


48 


915 


U 14971 


Homo sapiois 


ribowmal protein S9 


886 


90 


916 


AF 172854 


HcHDo sapiens 


cardiotjophin'*Ii]cc cytokine CX«C 


1204 


99 


917 


AC005525 


Homo sapiens 


F22162 1 


1963 


100 


918 


AF166350 


Homo sapiens 


ST7 protein 


4711 


99 


919 


Y87285 


Homo sapiens 


Human signal peptide containing ptntH" HSFP- 
62 SEQ ID NO:62. 


430 


100 


920 


Y36131 


Homo sapiois 


Human secreted protein #3. 


465 


88 


921 


AF193766 


Homo sapiens 


cytokine-like protein C17 


724 


100 


922 


Y95013 


HcfflQO sapiens 


Human secreted protein vo48 1, SEQ ID NO:66. 


357 


100 


923 


X75208 


Homo sapiens 


protein tyrosine kinaseniecqTtor 


5256 


100 


924 


y962(J2 




IksfmaB kinivK bindirur nmtein_ Y2H5fi 


813 


98 




AR039886 


Homo <tflnienQ 


Hnum-rpmlntMl in cratdrtc Rflnnpr 

wgUlCUSfxJ All guSUlV wCUIV^m 


785 


78 


yA\i 


r!033fi8 




Human ^ecTRted nmtein SFO 1T> NO' 74d.Q 


55 


SO 


927 


V48606 




Human hres^it tiimrnir-n^KnciatsI nmtein (^1 


539 


100 


928 


Y36151 


Homo sapiens 


Human secreted protein #23. 


668 


100 


929 


AFl 10399 


Homo sapiens 


elongation factor Ts 


1^ 


100 


930 


AF210317 


Homo sapiens 


&ciiitatrve glucose transporter fimily member 
0LUr9 


2763 


99 


931 


Y73328 


Homo sapiens 


HTRM clone 082843 protein sequence: 


931 


100 


932 


G019S9 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6040. 


274 


100 


933 


XJ47924 


Homo sapiens 


EH2e|] receptor associated protein 


1469 


100 


934 


G03827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7908. 


529 


93 


935 


An03Q371 




mifnchnnftnnl ARf^ trnncnnrt'n* ^ 
luiLuvAiuiiui itu r\x>\^ uauauuALbi w 


196 


63 


93^ 


XS6385 


Canis 


nb8 


1064 


100 


937 


B08906 


Homo sq>ieas 


Human secreted protein sequence encoded by 
gene 16 SEQ ID NO:63. 


117 


44 


938 


M13692 


Homo sapiens 


alpb^] ncid glycoprotein precursor 


1064 


99 


939 


Y53886 


Homo s^icns 


A suppressor of cytc^dne signaQing protein 
designated HSCOP-6. 


515 


42 


940 


Y16630 


Homo sapiens 


Human Putative Adrenomedullin Rec^rtor 
(PAR). 


1904 


99 


941 


AC005102 


Homo salens 


small inducible ^tokine subfimily A member 
24 


627 


99 


942 


M12SS6 


Homo sapiens 


T-cell receptw beta chain 


1289 


81 


943 


AF226046 


Homo sapiens 


GK003 


1049 


98 


944 


y36078 


Homo sapiens 


Extended human secreted protein sequence^ SEQ 
ID NO. 463. 


667 


100 


943 


M228r7 


Homo siq)iens 


cytochrome c 


565 


100 


946 


W67869 


Homosi^iens 


Human secrtied protein encoded by gene 63 
clone HHGDB72. 


551 


93 


947 




Homostqiiens 


Human secreted protein encoded by gene S3 
clcHie HBMCL41. 


283 


100 


948 


W85726 


Homo s^iens 


Novel protein (Clone BG33_7). 


789 


100 


949 


AJ242015 


Homo sapiois 


e\4DCII protein 


4236 


100 


950 


G04075 


Homo sapiens 


Human secreted protein, SEQ ID NO: 8156. 


567 


99 



124 



wo 01/57188 



PCT/USOl/03800 



SEQ 

ID 

NO: 


Accession 
No. 




riMTTi titi On 


QmtfrK 

omllu- 

Score 




9S1 


AFl 10645 


Homo S3picns 


candidate tumor si^pjn^essor p33 INGl hmnolog 


1314 


100 


952 


Y36111 


Homo sapiens 


Extended human scooted protein sec[uence, SEQ 
ID NO. 496. 


402 


70 


953 


AB012109 


Homo sapiens 


APCIO 




ion 


954 


AF246221 


Homo sapiens 


transmembrane protein BRJ 


1405 


100 


955 


AF054986 


Homo sapiens 


putative transmembrane (j l-Pose 


1 o<u 


ion 


956 


W74726 


Homo Allien 4 


f-Tiimnn ^ntirtMf mTit'nn foQdQ ^ 

lliUUIdU dWiwfcVU l/lUlwUI X^TY^ Jt 


lo/7 


inn 


957 


Y27096 


Homo sapiens 


Human viral recemtnr nmtetn fACTVRP^ 


1 JOi 


inn 


958 


AJ222967 


Homo <kiinien<: 


cvcfino^in 


1070 




959 


Y53052 


Homo sapiens 


Human secreted protein clone df202_3 protein 
seauencx SFO TlS NO*l 10 


587 


100 


960 


G02694 


H(wno QsniRnc 


Hitman ^ecrefed nmtehi S^O ID 'NO* A77^ 




inn 


961 


AF151855 


Homo fiflnipnc 


PGl-97 nrotein 




TO 


962 


U26592 




uiuocuRf mcliiup 1 Quiouijugcn 




AK 


963 








J /yo 


lUU 


964 


AF078859 


Homo cnnipnc 


Frooo4 




inn 


965 


AB020315 


nuiiiu Mtpidis 


uUlUUiUgUB Ui inOIUwS QKA'i gdiC^ACCff 


1 AtUi 


inn 
lull 


966 


X04571 


Homo sapiens 


precm^r polypeptide (AA -22 to 1 1 85) 


6580 


99 






xxuoio Sapiens 


hepatocellular carcinoma antigen gene S20 




OA 


968 




TlUUMv oUpiCOS 


minA^-icJaiCu pepuuc i, iviuu^i 




iUU 






Homo sapiens 


mcmbranc-^^pe-5 m&tnx metalloprotctnasc 


3545 


lUO 






XlLnno aapicna 


cyclin L 8niar6a 


157!* 


100 


971 




riUlilO aapiCUS 


N.-V'i couanspoTOr AJL>L>f 


5oZl 


99 


972 


AF083248 


Homo sapi^ 


ribosomal protein L26 homolog 


739 


100 


y/j 




Homo s^ims 


hypcrpolarization-activatcd cyclic nucleotide 
^ed cation diannd hHCN4 


6295 


100 




Wololy 


Hcnno sapiens 


Clone HTPEFoo Of TM4SF supedamily. 


454 


100 


y/j 




Homo sapiens 


zinc finger protein NY-REN-2 1 antigen 


2261 


100 


y/o 


A1?0'7CQ>IQ 

/VrZ/Oy'lo 


Itomo sapims 




11763 


99 


977 


AB02689] 


Homo salens 


cystme/glutamate transponer 


2552 


100 


978 


AFl 17o57 


Homo sapiens 
— 


tiiyroid hormone receptor-associated protein 
complex component IKArSU 


3348 


99 




AT?n>tjioni 


Rattus 
norv^cus 


neural membrane protein 35; NMP35 


1570 


92 




API 1Q90T 


I^Mno s&piois 


neuroendo(7ine-q)ectfic protein-iike proton 1 


1170 


99 


QUI 




Homo s^)iQis 


potassium channel modulatoiy factor 


1>S3 


99 


982 




xiuuiu ■K^jicns 


Human stomal^ carcmomn clone HP 1041 2" 
encoded imTtcin. 




GO 






nomo salens 


interf^x>n regulatory factor 3 


ZUIZ 


yo 


984 ' 




Homo sapiens 




f 1 cn 


100 


985 


Y 14482 


Homo s^iens 


Fragment of human secreted protein encoded by 
gene 17. 


172 


70 


986 






iT'tdicnioniiM? rec^)u>r k-rV^ivf 




100 

lUU 


987 


W27291 


Homo sapiens 


Human H1075-1 secreted protein 5' end. 


712 


100 


988 




scxt& 


Juvemle hoimone estxrase bindmg protein 




52. 


989 


G03697 


ffnmo cnnipnQ 


XlUIUtUl aCUdCU piUUTUl, IXJ r%\j» 1 1 /o> 


1 0A 


oo 


990 


AF204159 


Tfnmn cnniMtc 


pvuuoiuui tosjffs cunuuwUiuu; Caiciuizi— acuvmcu 

channel beta 3a subimit 




inn 


991 


G02061 




XlUHIau SCCIdCu piUiCU], OCV^ \U INw. OlHiM 




QO 


992 


AI 031266 


V/QCUUI IfitUUi u 

selegans 


VMIfV^ 1 
V mi uQxv. 1 


J^/ 


•HI 


993 


Y66749 


Homo canipnQ 


inciiiurtuii^iHnuiu piuusui f i\.v/i l^t. 


AT?ft 


CIQ 


994 


G01246 


Hnmo cnnimQ 


HlimAn «m<wfm4 nml-^in QPO W\ UO* VX'Yl 

numaii scmCICU piuLCu]] ocv^ r%\J» jj^i* 


1A1 


/ / 


995 


AF133845 


Homo cnniPTiQ 


conn 


^Xl 1 

JOl i 




996 


AFl 17756 


Homo s^iens 


thyroid hormone receptor-associated protein 
conq)lex component TRAP150 


4999 


100 


997 


W62066 


Homo sapiens 


Human stem cell antigen 2. 


284 


93 


998 


Y87173 


Homo sapiens 


Human secreted protein sequence S£Q ID 
N021Z 


725 


100 


999 


Y13379 


Homo sapiens 


Amino acid sequence of protein PR0263. 


1654 


99 


1000 


Y95008 


Homo sapiens 


Human secreted protein vG J. SEQ ID NO:56. 


676 


47 


1001 


AF190167 


Homo sapiois 


membrane associated protein SLP-2 


1747 


100 



125 
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no. 
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% 

Identity 






xiomo sapicua 


riumaii secTnea proieiii, otCA^ lu ri\j. dj id. 




it* 


1003 


W73420 


Homo sapiens 


Human secreted protein encoded by Gene No. 


2150 


100 


1004 


X12791 


Homo sapiens 


19kD SRP-protein (AA 1 - 144) 


742 


100 




MZ33Z3 


Homo sapiens 


membiBne protein 


642 


100 


lUUO 


AW745 


Homo sapiens 


KUcL receptor 


326 


98 


tfityi ' 

luuv 


Y35997 


Homo sapiens 


Extended human secreted protein sequence, SEQ 

11/ NU. JoZ. 


824 


99 


IUU5 




¥Tl ll 11 ll Hl ■!! 

ciyioDaLes 
moloch 


dopamine vccej^oi D4 


92 


35 




iVlooO 


Homo sapiens 


Human secreted protein sequence encoded by 
gene 81 Sbii ID NO:333. ■ 


1372 


99 


1 Am 


ATI 'i£t^< 


Homo sapiens 


aJ304BI4.1 (novel piotem) 


825 


98 


T Af 1 


G03733 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7814. 


379 


98 


10 12 


Y17531 


Homo sapiens 


Human secreted protein clone BL205 14 protein. 


818 


97 


1013 


G00724 


Homo sapiens 


Human sea<^ed protein, SEQ ID NO: 4805. 


462 


100 


1014 


AF288092 


Naegleria 
gniberi 


haem lyase 


114 


37 


lOlS 


AB045292 


Homo sapiens 


M83 protein 


3867 


99 


1016 ' 


X15940 


Homo sapiens 


ribosomal protein L31 (AA I-12S) 


644 


100 


1017 


Y94873 


Homo sapiens 


Human protein clone HP02632. 


1876 


100 


1018 


AL024498 


Homo sapiens 


dJ417M14.1 (novel protem) 


589 


100 


1019 


X83425 


Homo sapiens 


Lutheran blood group glycoprotein 


3054 


99 


1020 


W03516 


Homo salens 


Prostaglandin DP receptor. 


1864 


100 


1021 


C03960 


Homo sapiens 


Human secreted protein, SEQ ID NO: 8041. 


398 


100 


1022 


Y9168;9 


Homo sapiens 


Human secreted protein sequence encoded by 
gene 93 SEQ ID NO:362. 


768 


100 


1023 


AE000660 


Homo salens 


hADV36Sl 


573 


100 


1024 


AF 132965 


Homo sapirais 


CXjI-31 protdn 


1550 


100 


1025 


W92380 


Homo sapiens 


Human I'K-intBracting protein S1038. 


1466 


97 


1026 


R66278 


Homo siqjiens 


Therapeutic polypq>tide fiom glioblastoma cell 
line. 


830 


100 


1027 


X6S614 


Homo sapiens 


SIOOP calcium-binding protein 


476 


100 


1028 


Y41741 


Homo sapiens 


Human PRO704 protein sequence. 


1323 


100 


1029 


AJ001014 


Homo sapiens 


RAMPl 


806 


100 


1030 • 


W63682 


Homo sapiois 


Human secreted protein 2. 


1354 


99 


1031 


AK023007 


Homo salens 


unnamed protein product 


766 


100 


1032 


W97900 


Homo sapiens 


Human SR-BI dass B scavaiger. 


2672 


99 


1033 


Y82453 


Homo sapiens 


Human TGC-440 secretory imrtein SEQ ID 
NO:I. 


639 


99 


1034 


Y73473 


Homo sq)ieos 


Human secreted protein clone ydl78_l protein 
sequence SEQ ID NO: 168. 


752 


93 


1035 


Y86468 


Homo sapiens 


Hmnan gme 48-ciicoded protein fiagment, SEQ 
ID Mu:383. 


96 


90 




UUVoli 


Homo sapiens 


mitochondrial ATP synttase sufaunit 9 precursor 


698 


100 






Homo sapiens 


calpain 


3699 


99 


mis 


A0O4U3 


Homo sapifflis 


acetylcholine receptor epsilon subunit CHRNE 


OCT* 

ZD/4 


1 AA 
lUU 


1039 


AJ242730 


Homo sapiens 


polybomeotic 2 


1310 


100 


1040 . 


AF169968 


Mus 

musculus 


DNA bindbmg protein DESRT 


1453 


80 






Bos tennis 


ETT = ' = 

permability mcreasmg protem 


383 


79 


1042 


G0Q368 


Homo sapiens 


Human secreted protein, SEQ ID NO: 4449. 


75 


50 


1043 


G02S32 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6613. 


60 


S3 


1U44 


K494582 


Homo sapiens 


interleukin 8 rec^>tor B 


1850 


100 


1045 


AL080239 


Homo sapiens 


bG2S6022.I (similar to lOFALS (insulin-likc 
growdi fiictor binding protein, acid labile 

StUUIUllLJ^ 


1704 


50 


1046 


AF125101 


Homo sapiens 


HSPC040 protein 


580 


100 


1047 


W74809 


Homo sapiens 


Human secreted protein aacoded by gene 81 
clone HMWDN32. 


176 


100 


1048 


AL022238 


Homos^ens 


dJI042K10.4 (novel protein) 


2201 


100 


1049 • 


W88667 


Homo sapiens 


Secreted protein encoded by gene 134 done 
HAIBP89. 


1559 


99 


1050 


AF097518 


Eomos^ens 


liver-specific transporter 


2820 


100 



126 
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SEQ 


Accf^Ktttn 


SpCClGS 


L'CoUIipuUQ 


Smith" 


% 


ID 


No. 






^^^aticiman 


loenuQr 


NO: 








OOuTC 


1051 ■ 


W78324 


Homo sapiens 


Fracment of humfm secreted nmt^n mnnHprf Kv 




OR 








gene 81. 






10S2 


Y21851 


Homo s^icDS 


Human signal p^tid&conti&ning protein (SIGP) 


1643 










(clone ED 2328134). 






10S3 


AL16381S 


Antbidopsis 


putative protein 


661 


ox 






thaliana 








1054 


Y76200 


Homo sapiens 


Human scd'cted protein encoded by gene T7 




inn 


1055 


AI276557 


Homo sapiens 


TClO-like Rho GTPa<?e 


1 

1 lOU 


inn 


1056 


Y27620 


Homo sapiens 


Human secr^ed orateln encnd^ hv apnp Mn ^A. 


154 




1057 


D14530 


Homo sapiens 


rihosomnl nmtein 


/*rJ 


inn 


1058 


AF132000 


Homo sapiens 


TADA1 nmtefn 






1059 


AL031778 


Yfnmn cnnipnc 






mo 




















Iso<]uinoIine-biiK]ing protein)) LIK£ protein) 






1060 


AF227135 


Homo iS?piens 


candidate tnjd'e reoantor T*7ttQ 






1061 ' 


Y27575 




rfiiirmn o^<*'r^t^/1 nmti^in t*\\r€\Ai^\ Irtr non* O 
flUlliOU drCbiClCU piUtCiU f^CUUCu gCZiC SAQ, 7. 




1 oo 


1062 


211697 






iUoo 


1 AO 

lUu 


1063 


AFl 23757 




puiouvc uausincinDrBnB pjvicui 


Old 


1 AO 




API ^Sl'^^ 


Homo sapiens 


novel retinal pigment epithelial cell protein 


2932 


99 




'WA.'XfnA 


Homo sapiois 


Human cbannel-reiated molecule HCRM'-2. 


936 


99 


1066 




Homo sapiens 


Kao3 \jiJr/u i r excDsnge lactornomologue 


2575 


100 




I JOUo / 


Homo sapiens 


Extended human secreted protein sequence; SEQ 


770 


85 














1068 




■ : 

Homo sapiens 


Human secreted {nx)tein clone mc300 1 protein 


301 


100 














1069 


Y94959 


IaUIUU aUIflcUa 


nuiiiaD acwctca pruicul citxie mwW l prooem 


JUl 


100 














1070 


W64S35 




nuiiicm icui^oifjic ecu uonc nxAiuoii^ proKllL 




oo 


1071 


X03145 




tvit III 

pui. l^iU* JXl 






1072 


AL031177 




ujoo7A^u.j ^novei pioiein^ 


vol 


91 


1073 


X82200 


Unrvm cam ona 






62 


1074 


G03213 




l-liimnn < i^l^jl nvntMn CC/~i TF) XI^* '70Q/1 

nujuaii acucLCu pruLCUi, oEi\^ inv-f . i£y*^. 




47 


1075 


Y36S3 


Hninn cnnifMic 
jnuiiiu dtt^icio 


nuiiiaii acucLcu pruicui cucoacQ Djr gene lu. 


DUO 


55 


1076 


(303187 




Hiimnn CAnr^Ml nmt^tn 1 1 1 "Klf^- T7A9 
nuiiiau acucicu piuiCiUj OCA^ iLi idm\>Or 






1077 


L25899 


nuiito sopicna 


ribosomsl protein LIO 




76 


1078 


Y91447 




nujiiaii scmctoq piuKciu sequence cncouco uy 


OOQ 


97 














1079 


G01862 




Humnn OMm^'eri nrrttpin ^P/~l 111 "Mn* ^Qd.'7 

xiuiiHUi acudJCU |/j\iicuiy Ha' i^W* 




oo 


1080 


AB039723 


Hnmn RnntpnQ 






oo 


1081 


AB020527 


Hmnn cnnirac 

M,M.\/\M\t Ml^l&lld 




'>A0 


1/V) 

lUU 


1082 


LI 3802 


Hnirin onnipne 


rilvtcmnl nmt^n email ciihimtf 




on 
oU 


1083 


W75098 


Hnmn onnimQ 


HllTYlftn CM^ffiPfM^ nmtpin Mirwlml Kv a^n» 
nuuiiui a^JCtPU ^juicui cuujucu Djr gCUC *^£m 




fil 








clone HSXBI25. 






1084 


G03564 


Homo sapiens ' 


Human .secreted Dmtein. SbO ID NO- 7A1^ 


OJ 


CI 

J t 


1085 


G04063 


Hnmo dnipnc 


Humnn ceciftferi nrntptn SKI) in'MO' llldA 


DO 


A'X 


1086 


AP090942 


Homo sapiens 


PRO0657 


124 




1087 


G00517 


Homo sapiens 


Human secreted protein, SlUi ID NO: 4598. 


129 


41 


1088 


G04091 


Homo sapiens 


Human secreted Drntein. 5?RO ITJ NO- R179 


IM 

t AkO 


JO 


1089 


AFI40631 


Homo saniens 


Vl^llJ vVUUluU Xv^^^U^UX X*T 




oo 
ox 


1090 


G04063 


Hnmo cnnipnc 


Himian Qftcn*tAd nmtptn II) MO* SiXAA 

XlUXiMll OOVICICU luULClUy OC^^ 11^ lNv/> OlHf ■ 


1 1 A 
1 il 




1091 


S72304 


1^11 (| on 






<M 

aJ 


1092 


W88708 








lUU 








HEMAM41.- 






1093 


W85612 


Homo sapiens 


Secreted Drotein clone fhl 23 ^ 


4?^R 




1094 


Y53012 


Homo anient 


Hiimnn I'pff nrntyin dnncnmSIA ^ Tvntt^n 

X JUIIITIII M^VIWtbU JllUUflU WXUiXb Lilll^X^ ^ UlUtClXl 


lUl J 


oo 
yy 








crouennR SPO TD NO*30 






1095 


Y92345 


Homo sapiens 


Human cancer associated antigen precursor from 


409 


100 








cl0DeNY-REN-6i 






1096 


AF090942 


Homo sapiens 


PRO0657 


147 


60 


1097 


L24S21 


Homo sapiois 


transforraaticHi-rcIated protein 


166 


58 


1098 


X56932 


Homo sapiens 


23 kD highly basic protein 


490 


70 


1099 


G04063 


Homo salens 


Human secreted protein, SBQ ID NO: 8144. 


83 


35 


1100 


Y02693 


Homo sapiras 


Human secreted protein encoded by gene 44 


149 


59 








clone HTDAD22. 
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SEQ 
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No. 
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Identity 


1101 


AFl 19851 


Homo sapiens 


PR01722 


183 


72 


1102 


G04086 


Homo sapims 


Human secreted protein, SEQ ID NO: 8167. 


207 


62 


1103 


G04063 


Homo s^iens 


Human secreted protein, SEQ ID NO: 8144. 


91 


52 


1104 


X74856 


Mus 

muscuJus 


ribosomal protein L28 


128 


69 


1105 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7870. 


130 


62 


1106 


G03133 


Homo sapiois 


Human secreted protein, SEQ ID NO: 7214. 


122 


48 


1107 


G03040 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7121. 


69 


43 


1108 


AF039942 


Homo sapiois 


HCF-binding transcription ftctor Zhangfei 


744 


99 


1109 


AF201951 


Homo salens 


high afSnity immunoglobulin epsilon recq>tor 
betasubunit 


738 


94 


1110 


AFnil08 


Mus 

musctthis 


transient receptor potential 2 


223 


79 


nil 


AFl 19900 


Homo salens 


PR02822 


144 


59 


1112 


Y16589 


HiHno salens 


A protein that interacts with presenilins. 


265 


39 


1113 


G02872 


Homo scions 


Human secreted protein, SEQ ID NO: 6953. 


178 


67 


1114 


Y02999 


Homo sapiens 


Fragment of human secreted protem encoded by 
gene 121. 


164 


63 


ins 


Y30811 


Homo sapiens 


Htmian secreted protein encoded fiom gene 1. 


1217 


99 


1116 


X51394 


Xenopns 
laevis 


APEG precursor protein 


130 


40 


1117 


M27826 


Homo stqiiens 


neutral protease large subunit 


442 


65 


1118 


G03371 


Homo sapiens 


Human seoeted proteiii, SEQ ID NO: 7452. 


72 


60 


1119 


G036Q2 


Homo s:q>iBi]s 


Human secreted protein, SEQ ID NO: 7683. 


491 


97 


1120 


Y35906 


Homo sapiens 


Extended human seaeted protein sequence, SEQ 
ID NO. 155. 


244 


97 


1121 


G03714 


Homo sapiens 


Human secr^ protem, SEQ ID NO: 7795. 


122 


^5 


1122 


Y00337 


Homo sapiens 


Human seoeted protein encoded by gene 81 . 


no 


90 


1123 


AF084830 


Homo sapiens 


two pore domain K+ channel; TASK-2 


703 


94 


1124 


AF212862 


Homo ss^iens 


membrane interacting protein of RGS16 


442 


88 


1125 


W64469 


Homo sapiens 


Human secreted protein from clone CW79S 2. 


191 


53 


1126 


G01361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 5442. 


154 


100 


1127 


G01361 


Homo salens 


Human secreted protein, SEQ ID NO: 5442. 


165 


100 


1128 


Y84320 


Homo salens 


Human cardiovascular system associated protein 
kinase-1. 


815 


99 


1129 


G0210S 


Homo sapiens 


Human secreted protein, SEQ ID NO: 61 86. 


88 


73 


1130 


Y32923 


Homo sapiens 


Transmembrane domain containiqg protein clone 
HP0151Z 


700 


100 


1131 


Y29817 


Homo sapiens 


Human synapse rdated glycoprotein 2. 


260 


91 


1132 


Y91644 


Homo sapiens 


Human secreted protein sequence encoded by 
g6ne43SEQIDNO-J17. 


525 


96 


1133 


Y91449 


Homo sapiens 


Human secreted protein sequence encoded 
gene 49.SEQ ID N0:170. 


542 


100 


1134 


AB017908 


Homo sapiens 


4F2 light chain 


2399 


93 


1135 


X51760 


Homo si^iens 


2dnc finger protein (583 AA) 


312 


55 


1136 


Y99426 


Homo sapiens 


Human PRO1604 CUNQ785) amino acid 
sequence SEQ ID NO308. 


917 


72 


1137 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7871. 


102 


50 


1138 


AF155106 


Homo salens 


NY-REN-36 antigen 


768 


91 


1139 


AL031055 


Homo sapiens 


dJ28H20.1 (novd protehi similar to membrane 
transport proteins) 


117 


50 


1140 


AF011355> 


Bostaunis 


regulator of G-protcin signaling 7 


138 


96 


1141 


Y70018 


Homo sapiens 


Human Protease and associated pnitein-12 
(PPRO-12). 


623 


100 


1142 


C04091 


Homo s^ieos 


Human secreted protein, SEQ ID NO: 8172. 


113 


38 


1143 


AB03023S 


Canis 
&miliaris 


D4 dopamine receptor 


89 


48 


1144 


Y94922 


Homo salens 


Human secreted protein clone pv6_I protein 
sequence SEQ ID NO-.50. 


539 


88 


1145 


XSW62 


Homo sapiens 


lab-related GTP-binding prot^ 


398 


96 


1146 


G03807 


Homo sapiens 


Human, secreted protein, SEQ ID NO: 7888. 


168 


79 


1147 


G03712 


Homo sapiens 


Human secreted protein. SEQ ID NO: 7793. 


512 


85 


1148 


Y28279 


Homo sapiens 


Human G-protein coupled receptor GRIR-1. 


705 


76 


1149 


U13642 


Caenoiiiabditi 


exon S similar to transmembrane domain of S. 


247 


36 
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SEQ 

ID 
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No. 


Species 


Descnption 


Smith- 

Waterman 

Score 


% 

Identity 






sel^ans 


cerevidae zinc resistance protein 






IISO 


C03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7519. 


117 


62 


llSl 


G01003 


Homo sapiens 


Human secreted protein, SEQ ID NO: 5084. 


181. 


80 


11S2 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7879. 


198 


63 


ns3 


X88799* 


Oryzasativa 


DNA binding protein 


95 


41 


11S4 


D8S245 


Homo sapiens 


TR3beta 


155 


96 


1155 


R74272 


Homo sapiens 


Tumour suppressor protein, p53. 


341 


87 


1156 


Y86265 


Homo sapiens 


Hunan secreted protein HUSXE77. SEQ ID 
NO:180. 


99 


41 


1157- 


G02577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6658. 


263 


98 


1158 


AF104334 


Homo sapiens 


putative organic anion transporter 


185 


42 


1159 


G01393 


Homo sapiois 


Human secreted protein. SEQ ID NO: 5474. 


173 


57 


1160 


W75771 


Homo sapiens 


Human GXP binding protein APD08. 


224 


81 


1161 


AF2 16833 


Homo sapiens 


M-ABC2 protein 


410 


83 


1162 


W67816 


Homo s^iens 


Human secreted protein encoded by gene 10 
clone HCEMU42. 


1156 


100 


1163 


AFl 19851 


Homo ss^ieas 


PR01722 


230 


70 


1164 


Y87252 


Homo sapiens 


Human signal peptide containing protein HSPP- 
29SEQIDN029. 


113 


31 


1165 


W64537 


Homo sapiens 


Human liver cell clone HPOl 148 protein. 


338 


82 


1166 


AF269286 


Homo sapiens 


HC6 


134 


64 


1167 


Y14482 


Homo S8f>iens 


Fragment of human secreted protein encoded by 
gene 17. 


149 


51 


1168 


D90789 


Escheridiia 
coU 


Dipeptide transport system permease protein 
DppC. 


411 


90 


1169 


R63783 


Homo sapiens 


TG0847 protein. 


344 


90 


1170 


Y45274 


Homo sapiens 


Human secreted protein encoded from gene 1 8. 


478 


98 


1171 


D64154 


Homo sapiens 


Mr 11 0.000 antigen 


347 


96 


1172 


AB026256 


Homo sapiens 


organic anion transporter OATP-B 


311 


67 


1173 


G00357 


Homo siqpiens 


Human secreted protein, SEQ ID NO: 4438. 


60 


52 


1174 


D87717 


Homo sapiens 


similar to hiunan G'l Pase-activating 
protcin(A49869) 


178 


59 


1175 


M64716 


Homo sapiens 


ribosoraal protein 


391 


78 


1176 


R08330 


Homo sapiens 


Human IL-7 receptor clone H6. 


285 


67 


1177 


L06505 


Homo salens 


ribosomal protein L12 


242 


72 


1178 


AJ251885 


Homo sapiens 


organic cation transporter (OCT2) 


276 


88 


1179 


G03258 


Homo sapiens 


' Human seoeted protein, ^Q ID NO: 7339. 


155 


71 


1180 


G01207 


Homo s^iens 


Human secreted protein, SEQ ID NO: 5288. 


282 


90 


1181 


AF181SS6 


Kattus 
norvegiciis 


tRNA selenocysteine associated protein 


249 


62 


1182 


AF161524 


Homo sapiens 


HSPG176 


138 


90 


1183 


G03789 


Homo salens 


Human secreted protein, SEQ ID NO: 7870. 


282 


66 


1184 


Y02671 


Homo sapiens 


Human secreted protein encoded by gene 22 
clone HMSJW18. 


107 


71 

• 


1185 


G03797 


Homo sapiens 


Human secreted proteii^ SEQ ID NO: 7878. 


88 


€9 


1186 


G03564 


Hchho salens 


Human secreted protein, SEQ ID NO: 7645. 


118 


46 


1187 


AB03290S 


Hylobates 
concolor 


dopamine Kceftox D4 


96 


37 


1188 


G00956 


Homosq)iens 


Human secreted protein. SEQ ID NO: 5037. 


292 


78 


1189 


G032S8 


Homo sqjiens 


Human secreted protein, SEQ ID NO: 7339. 


178 


79 


1190 


G03361 


Homo sqiiens 


Human secreted protein, SEQ ID NO: 744Z 


324 


76 


1191 


AFl 17755 


Homo sq>iens 


thyroid homionc recqjtor-essodated protein 
complex component TRAP230 


187 


70 


1192 


Y70455 


Homo sq>iens 


Hiunan membrane channel protein-5 (MECHP- 
5). 


202 


67 


1193 


G03052 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7133. 


99 


42 


1194 


G02607 


Homo sapiens 


Human seoeted protefai. SEQ ID NO: 6688. 


192 


76 


1195 


W25»661 


Homo sapiens 


Homo s^iens CI542_2 clone secreted proteia 


2001 


98 


1196 


Y14104 


Homo sapiens 


Human GABAB receptor Id protein sequence. 


239 


69 


1197 


X61972 


Homo sapiens 


maoopain subunit iota 


149 


90 


1198 


G00534 


Homo sapiens 


Human secreted protein, SEQ ID NO: 4615. 


145 


51 


1199 


Y86260 


Homo sq>iens 


Human secreted protein HELHN47, SEQ ID 
NO:175. 


1089 


89 


1200 


G02607 


Homo sapiois 


Human secreted protein, SEQ ID NO: 6688. 


154 


57 
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SEQ 

ID 

NO: 


AccessioD 
No. 


Species 


Description 


Smitb- 

Watoman 

Score 


% 

Identity 


1201 


G00838 


Homo sapiens 


Human secreted pnrtetn, SEQ ID NO: 4919. 


404 


50 


1202 


M27826 


Homo sapiens 


neutral protease lat^ge subunit 


202 


49 


1203 


Y73424 


Homo sapiens 


Human secreted (notein clone yi4_l protein 
sequence SEQ ID NO:70. 


265 


61 


1204 


AF264014 


Homo sapiens 


scavenger receptor cysteine-ridi type 1 proton 
M160 precursor 


625 


98 


1205 


Y36203 


Homo sapiens 


Hionan secreted protein #75. 


219 


59 


1206 


U78111 


Gallus gaJliis 


AQ 


205 


57 


1207 


AF095448 


Homo s^iens 


patative G protein-onipled recqrtnr 


416 


76 


1208 


AFl 16715 


Homo sapiens 


PR02829 


127 


75 


1209 


AF099137 


Homo sapiens 


MaxiK channel beta 2 subunit 


475 


95 


1210 


AF205718 


Homo sapiens 


hq>atoceUular carcinoma-ielated putative tumor 
suppressor 


423 


79 


1211 


Y27868 


Homo sapiens 


Human secreted protein encoded by gene Na 
107. 


224 


70 


1212 


C300719 


Homo sapiens 


Human secreted protein, SEQ ID NO: 4800. 


117 


44 


1213 


G01009 


Homo sapiens 


Human secreted proteiii, SEQ ID NO: 3090. 


351 


73 


1214 


AF090942 


Homo sapiens 


PRO0657 


124 


70 


121S 


Y14427 


Homo sapiens 


Himian secreted protein encoded by gene 17 
clone HSIEA14. 


99 


77 


1216 


G03905 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7986. 


173 


57 


1217 


Y57897 


Homo sapiens 


Human tiansmranbraae protein HTMPN-21. 


1173 


100 


1218 


J00194 


Homo sapiens 


hla-dr antigoi alpha diain 


454 


78 


1219 


Y59709 


Homo sapiens 


Seoeted protein 76-28-3-A12tFL1. 


470 


92 


1220 


W81576 


Homo sapiens 


EBV-induced G-protein coupled receptor (EBI- 
2} polypeptide. 


725 


100 


1221 


W96745 


Homo sapiens 


High afiBnity inmnmoglobulin £ receptor-like 
protein (lOERB). 


650 


98 


1222 


Y35911 


Homo sapiens 


Extended human secreted protein sequence. SEQ 
ID NO. 160. 


135 


31 


1223 


Y00278 


Homo si^iens 


Human secreted protein encoded by gene 2 1 . 


260 


95 


1224 


AF161422 


Homo sapiens 


HSPC304 


568 


90 


1225 


U14970 


Homo sapiens 


ribosomal protebi SS 


202 


95 


1226 


G01733 


Homo sapiens 


Human secreted protein, SEQ ID NO: S814. 


610 


100 


1227 


AF099973 


Mus 

rausculus 


schlafen2 


333 


56 


1228 


G01218 


Homo's^iens 


Human secreted protein, SEQ ID NO: 5299. 


155 


81 


1229 


AF217188 


Mus 

musculus 


YIPIB 


801 


63 


1230 


AF176813 


Homo sapiens 


soluble adenylyl cyclase 


275 


100 


1231 


X98333 


Homo sapiens 


organic cation transporter 


1704 


100 


1232 


W74955 


Homo sapiens 


Human secreted protein encoded by gene 77 
cl(xieHOEAS24. 


212 


53 


1233 


Y94940 


Homo sapiens 


Human seoxted protein clone yi62_l protein 
sequence SEQ ID NO:86. 


526 


100 


1234 


U76618 


Mus 

musculus 


N-RAP 


482 


82 


1235 


AF044924 


Homo sapiens 


hook2 protein 


380 


97 


1236 


(301459 


HonK> sapiens 


Human secreted protein, SEQ ID NO: 5540. 


417 


100 


1237 


AF000018 


Homo salens 


adapter protein 


164 


84 


1238 


W88633 


Homo sapiens 


Secreted protein encoded by gene 100 clone 
HE8EU04. 


250 


90 


1239 


W29660 


Homo sapiens 


HcKDO sapiens C3I27_1 clone secreted protein. 


697 


98 


1240 


AF004I61 


Oiyctolagus 
cimicuhis 


peroxisomal Ca-dependent solute canier 


154 


52 


1241 


Y92710 


Homo sapiens 


Human membran&«ssociated protein Zsi^4. 


709 


97 


1242 


Y9S002 


Homo sapiens 


Human secreted protein vc34_l, SEQ ID NO:44. 


908 


88 


1243 


Y44905 


Homo sapiens 


Human potassiura channel molecule ERG-LP2 
partial protein. 


325 


100 


1244 


AF284422 


Homo sapiens 


cation-chloride cotransporter-intraacting protein 


SU 


97 


1245 


Y53629 


Homo sapiens 


A bone marronr secreted protein designated 
BMS115. 


1888 


93 


1246 


AB039371 


Homo salens 


mitochondrial ABC tran^rter 3 


389 


97 


1247 


Y35911 


Homo sapiens 


Extended human secreted protein sequence, SEQ 


168 


39 
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SEQ 


Accession 


Species 


OeRPTifirifin 


Smitl>- 


% 


ID 


No. 






tT aJDM mall 


lUQUtljr 


NO: 








Score 










ID NO ISO 






1248 


AF072509 




Dliitnmntr rM^pntnr tntprartino nrntpin ^ 










norvcgicus 








1249 


AF247042 


Homo sapieng 


tandem pore domain potassium cbsnnel I^RAAK, 


661 


98 


1250 


B08974 


Hoino sflpicns 


Human secreted protein ^quoice encoded by 


1087 


97 








gene 27 SEQ ID NO: 131. 






1251 


L1S313 


Camoiiiabditi 


putative 


858 


59 






s clegans 








1252 


Y29338 


Hoitio s^iens 


Human secreted protein clone it217 2 alton&te 


278 


75 








reading frame protein. 






1253 


W01730 


Homo fianieTm 


Human Ci-nrntein recentor HPRAJ70 


211 


92 


1254 


G03074 


Homo onnimq 


Human ^ecmtMl timtrm fTF/^ fD 'ND* 71 




Oj 


1255 


□01818 


Homo cnnipnc 


Himinn oecrptRil nro^m f^VjTi ID "hSCi' SXQQ 


253 


91 


1256 


AF286368 


Hnmn snnimv 


eppuv"! 


999 




1257 


AF220264 


Homo sspicns 


MOST-1 


87 


93 






noiuu Salens 


xiumaii sccicicu piuicm, os^a^ uj imu. ojUo. 




/o 






xiuiiiu Sapiens 


Human secreted protein fisgnient ^2 encoded 


fil 










&om gene 26. 












Tumor necrosis factor receptor 1 dea;& domain 




1 no 
lUU 








Iioanfl fe'Xr^m ^TUA 

iiganu ^cionc d irij. 






1261 






Tin/* m^tnl IftnmtMtcp AT^AXifTCU^ 
Liln* lllCuUlupruiCoaC nUnrix I ou 


179 


jO 






Homo sapiens 


semapholin V 




£.•7 


IZCo 




Homo sapicDS 


Renal cancer associated antigen precursOT 


288 


71 








sequence. 








X Jul 33 


,_ , 

Homo sapiens 


Human secreted protein #25. 


187 


80 


12CO 


Y78114 


Homo sapiens 


Human cytokine signal regulator CKSR-2 SEQ 


723 


93 








iiJ NU:2. 






19/^ 


* 1 jjy/ 


Homo salens 


Amino acid sequoice of protein FR0334. 


191 


100 


19rfi'7 




Rattus 


phosphatidylinositol 5^hosphate 4'4dnase 


859 


95 






norvegicus 


gamma 






JxOo 


U/JlO/ 


Homo sapiens 


candidate tumor suppressor gene LUCA-1 


~T<o 


95 


19AQ 




MUS 


LMI5K2 


552 


76 






musculus 










ALiUOlojx 


Homo sapims 


jt^K^fWfl 1 ^WnvHHlnn . , |* t\i ii ' Mill i-^lln ■- - — 

uJjvUrj.l (nomologoitneratsyns^ticras 


820 


98 








\j 1 rnjiC-oCQvaiuig proccm pu^ jyuKjnirj 






1971 


rai9i9ft 


. 

Homo sapiens 


Unman rtMrirafnii n«w\tB«n OBf^ Ill'Vf/^* iCOO^ 

riuman secreiea protein, ^c^i lu n\j. ozu/. 


1j1 


oc 


1999 




Homo sapiens 


"VTA 11 U i--ir4-n riVi rri .111 M li^ ■ ■ 1 -* ,._ ! ., .. P.. ■ 

NAL/xl'-Qrtocnrome dd reductase isoionn . 


25J 


92 




Af tn^/xi 


Homo sapiens 


ajjoov^j I.O (novel Aivu^-oinaing ensyme 


1280 


100 








aUnillu Uj UU^yi''COt3lZ]rIIlC n SyuUlCUlaSC 












(dcetst&<x>A ligase)) 








AFn/vl7Afl 
rVr VA?*r / H-O 


"Ti 




^<9^ 
















1275 


D17554 


Homo cnnipnc 


TA"5CRFH1(17 


377 


78 


1276 


Y30715 


Hnmo minima 




643 


00 








DTotein 






1277 


AF146760 


Homo sapiens 


sentiTi ^-tilce cell division t^ntml trmtein 


707 


100 


1278 


Y05069 






981 

I 




1279 


X59668 


Oiyctolagus 


aorta. PNG channel frACNO^ 


267 


85 






cunicuhis 








1280 


GOIOSl 


PTomn wiieim 


Human secreted nmteln SPjC^ TD NO* ^1 Vi. 


4ltQ 


98 


1281 


003411 




Human cm^r^mf ntvit^ln CPO ITl "MO* 9^09 

Jiuimui scxacim pruiciiiy aiiii^ nt/. imyt.. 






1282 


AF05S084 








100 


1283 


AFl 17814 


Mus 


oH d-^lciTWUv^ relfltBd 1 nTtifcpin 


357 


98 






miiepilliia 
U1U3WUIIU 








1284 


TI871I8 


ACllUpUO 












laevis 








1285 


AF061346 


Mus 


Edpl proton 


452 


68 






musculus 






1286 


AB030182 


Mus 


contains transmembrane (TM) region 


582 


68 






musculus 








1287 


A13595 


synthetic 


imnmnosuppresive protein PP15 


185 


97 






Gonstnict 








1288 


AF254411 


Homo sapiens 


ser/arg<4iG}i pie-mRKA splicing &ctor SR-Al 


837 


100 


1289 


AF084205 


Rattus 


serine/threonine protein kinase TAOl 


319 


98 






Dcirvegicus 
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SEQ 

IL) 


Accession 

INO. 


Species 


Description 


Smttb- 

Watennan 

Score 


94 

Idoitify 


1290 


AF038563 


Homo sapiens 


membrane associated guanylale kinase 2 


523 


100 




ArUJ4oj/ 


Homo sapiens 


double-stranded KNA specific adenosine 
deaminase 


468 


100 




ml JoOo 


Bos taurus 


endozepine-rclatcd protein precursor 


937 


87 


1293 


AB010692 


Aiabidopsis 
thali&na 


A'l r-dqiendait RNA helicase-lilce protein 


636 


45 






Homo sapiens 


orphan G-protein coupled receptor 


1570 


100 




Wo782o 


Homo s^iens 


Human secreted protein encoded by gene 22 
clone HFEAF41. 


504 


98 




AC0U4832 


Homo sapiens 


similar to 45 kDa secretory protein ; similar to 
CAA10644.1 (PID:g4164418) 


648 


65 


tzyt 


A80U33 


Oiyctolagiis 
ciuiiculus 


cysteine rich hair keratin associated protein 


575 


70 




G0264S 


Homo sapiens 


Human secreted protein, SEQ ID NO: 6726. 


223 


97 




Y59440 


Homo sapiens 


Human delta3 firagment #4. 


122 


32 


1300 


W70504 


Homo sapiens 


Leukocyte seven times membrane-penetrating 
type receptor piotein JEOl 8. 


459 


81 


1301 


Y67315 


Homo siq)iens 


Human secreted protein BL89_13 amino acid 
sequence. 


3916 


99 




M77693 


Homo sapiens 


spermidine/spemiine N 1 -acety Itransferase 


174 


96 


1303 


1301331 


Homo sa;>iea$ 


Human secreted protein. SEQ ID NO: 541Z 


254 


69 


1304 ■ 


G01491 


Homo sapiens 


Human secreted protein, SEQ ID NO: 5572. 


747 


99 


1305 


AF148509 


Homo salens 


alpha 1,2-nmnnosidase 


602 


98 


1306 


G016S8 


Homo sapiens 


Human secreted protein, SEQ ID NO: 5739. 


333 


98 


1307 


Y90899 


Homo salens 


Dl-like dopamine receptor activity modifying 
protein SEQ ID NO: 1. 


332 


98 


1308 


AF033120 


Homo sapiens 


p53 regul^ed PA26-T2 nuclear protein 


348 


52 


1309 


Y73388 


Homo sapiens 


H i KM clone 3376404 protein sequence. 


147 


66 


1310 


AF063243 


Bos taurus 


ribosomal protein L30 


296 


90 


1311 


AF224494 


Mus 

muscutus 


arsenite inducible RNA associated protein 


688 


70 


1312 


Y73342 


Homo sapiens 


H i KM clone 2709055 protein sequence. 


1154 


100 


1313 


Y99419 


Homo sapiens 


Human PRO1780 (UNQ842) amino acid 
sequence SEQ ID N0282. 


1145 


78 


1314 


AFl 16667 


Homo sapiens 


PR01777 


433 


97 


131S 


W75100 


Homo sapiens 


Human secreted protein encoded by gene 44 
clone HE8CJ26. 


807 


97 


1316 


AJ272078 


Homo sapiens 


APOBEC-1 stimulating protein 


789 


100 


1317 


AB041533 


Homo salens 


speim antigen 


2607 


98 


1318 


U19617 


Mus 

musculus 


Elf-1 


806 


92 


1319 


U82398 


Escheridiia 
coli 


feme enterobactin transport protem 


768 


100 


1320 


D90892 


Eschericfaia 
coli 


SORBITOL-6-PHOSPHATE 2- 
DEHYDROGENASE (EC 1.1.1.140) 
(GLUCrrOL-6- PHOSPHATE 
DEHYDROGENASE) (KETOSEPHOSPHATE 
REDUCTASE). 


709 


100 






Homo sapiens 


Human secreted protein encoded 1^ gene 41 
clone HPBCr74. 


601 


92 




/uZ/olUl 


Homo sapiens 


QPRC5B protein 


466 


93 




nS£i\}iMl 


Homo sapiens 


GPRC3B protein 


504 


97 






Homo sapiens 


Protein regulating geoe expression PROE-21. 


1584 


100 




uyijoi 


Rattus 
notvegicus 


pyridoxine S'-phoqjhate oxidase 


1277 


89 




Ar 


Homo sapiens 


N ADH-cytodirome b3 reductase isoform 


1606 


100 






Homo sapiens 


Human receptor molecule (REC) encoded by 

Incvtcs done ^XlSKld 


1531 


90 


1328 


AF151048 


Homo s^iens 


HSPC214 


657 


85 


1329 


Y10530 


Homo s^iens 


ol&ctoiy receptor 


1645 


100 


1330 


AF180681 


Homo sapiens 


guanine nucleotide exchange &ctor 


4314 


99 


1331 


AF1118S6 


Homo sapiens 


sodium dependent phosphate transporter isoform 
NaPi-3b 


3591 


99 


1332 


Y13583 


Homo sq]ieos 


G-protein coupled receptor 


2171 


lOO 


1333 


AF078866 


Homo sapiens 


SURF-4 


1395 100 
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SEQ 

ID 

NO: 


Accession 
No. 


Species 


Description 


Smitfa- 

Watennan 

Score 


% 

Identity 


1334 


Y25755 


Homo sapiens 


Human secreted protein encoded flora gene 45. 


1380 


96 


1335 


AF152325 


Homo s^iens 


protocadberin gamma A5 


4742 


99 


1336 


X74070 


Homo sapiens 


transaiption &ctor BIV3 


639 


81 


1337 


AF095927 


Rattus 
norvegicus 


protein jAosphatase 2C 


1931 


93 


1338 


G03877 


Homo sapiens 


Human secreted protein, SEQ ID NO: 7958. 


621 


100 


1339 


AL008S82 


Homo salens 


bK223H9.2 (ortfaolog of A. tbaliana F23F1 .8) 


626 


100 


1340 


X61613 


Homo sapiens 


leukemia inhibitory factw receptor 


5820 


99 


1341 


Y01519 


Homo s^iens 


A carcinogenEsis-inhibiting protein. 


7528 


97 


1342 


AF207600 


Homo sapiens 


ethanolamine kinase 


2372 


100 


1343 


U54807 


Rattus 
norvegiciis 


OTP-binding protein 


1167 


97 


1344 


AC020579 


Arabidopsis 
tfaaliana 


putative pho^oribosylformylglycinamidine 
syndiase; 2S509-29950 


3283 


51 


1345 


Y28576 


Homo sapiens 


Secreted peptide clone pe503_l. 


944 


100 


1346 


W74787 


Homo sapiens 


Human secreted protein encoded l)y gene 58 
clone HHFHN61. 


1171 . 


100 


1347 


M55542 


Homo sapiens 


guanylate binding protein isoforro I 


2636 


87 


1348 


AFl 83428 


Homo sapiens 


28.4 kDa protein 


1329 


100 


1349 


U70669 


Homo s^iens 


Fas-ligand associated fictor 3 


167 


24 


1350 


AF295530 


Homo s^iens 


cardiac voltage gated potassium channel 
modulatory subunit 


562 


99 



TABLES 



SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 

tn 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino add 

residue of 

pqitide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino.add sequence (A-AIanine 0<:ysteine, 
D=Aq>artic Acid, £s<31utamic Acid, 
F^Phenylalanine, G=Glycine, H=Histidine, 
l=IsoIeucine, K=Lysine, L=Leucinei 
M=^ethionine, N^Asparagise, P=Proline, 
Q=<jlutainine, R^Aiginine, S=Serine, 
T=Threoninc, V=Valine, W=Tiyptophan, 
Y=Tyrosine, X=Unknown, ♦=Stop oodon, 
/^possible nucleotide deletion. V=^ssible 
nucleotide insertion 


1 


1351 


A 


2 


337 


1 


TPSLIHQAPTPCPAGLWG/PFNGHYHGS*PGC 
HWPQAPHRA***GLLPPRWLGHGLPGGPAAP 
WAASQWVDGVAGRLPGPAWSWHASGAAPA 
QPGPL*LLVPGSSQLPDPRDP 


2 


1352 


A 


27 


100 


366 


KNSSIRPMKERETFCLSAKHMrrCSASYDIRGL 
QIETT\YHHTPIRMAKIQKT/GHHQC**ECGAT 
GTLIHGWWGCKWEPLGKTVWQIPK 


3 


1353 


A 


40 


3 


314 


. HASAHASWUKDNSELEQQLOATGAYRARA 
UHLEAEVAEMRQMLQLEHPFVNQADKLRPD 
SMYVHLNEL*QSLVENMLLTVVDTH\RTPI«R 
SCNYTLALILFL 


4 


1354 


A 


74 


2 


292 


TASALFSCPDGGSLAGFAGRRASFHLEC3LKR 
QKDRGGDISQKTVLPLHLVHHQVAHTFGQAT 
VTCQQARQSPG'RTNPE/ALQWVLPVSDGWH 
VULP 


5 


1355 


A 


78 


114 


850 


ENCRVASNLPGVFFSEDTAQSGSYMRISAHPP 

NAGGEVSNGPKRKLHAILNFSLPSSGLNAGA 

FYALSTLLNRMVIWHYPGEEVNAGBIGLTIVI 

AGMLGAYISGIWLDRSKTYKETnLVVYIMDT 

GGAWWCYTFYLGTGDTCG*CFirAG\TMGFF 

MTGVLPLGFEFAVELNSYPESEGISSGLLNISA 

QVFGHFnSQGQIIDNYGTKPGNlFLCVFLTLG 

AALTAFKADLRRQKANKETLEN 


6 


1356 


A 


81 


97 


376 


EWFSYMLGSNMSVYHSP*SLEPLCKVLSES*A 

YLRVPFIRILLNAR*IRKAYXRMSLE1KLLI/RB 

♦CLFQEMGLSLQWLYSARGDFFRATSRL 


7 


1357 


A 


93 


2 


872 


TLSSACUGDAWKELXrVAGAVSNQLLVWYP 
ATALADNKPVAPDRWSGHVGnFSMSYLESK 
GLLATASEDRSVRIWKQGDLRVPGGRVQNIG 
HCFOHSARVWQVKLLENYUSAOEDCVCLV 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 

BONO: 

in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

conespondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
localiini 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 

f 


Amino acid sequence (A=AIanine OCysteine, 
D^nAspartic Acid, E=GIutaniic Acid, 
F^henyblaninc, G=Glycinci H=Histidine, 
Hsoleucine, K»=Lysine, Lr^^eucine, 
M=Metliloa!ne, I^Aspaiagine, P^ProUne, 
Q=Glutanune, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tiyptoplian, 
Y=^Tyrosine, X=Unknown, ♦=Stop codon, 
^possible nucleotide deletion, V^ossible 
nucleotide insertion 














WSHEGEILQAFRGHQGRGIRAIAAHERQAWV 

rrOGDDSGlRLWHLVGRGYRGLG/DLGSLLQ 

VP**ARYTQGCDSGWLLATAGSD*YRGPVSL 

*RRGQVLGAAARG*TFPVLLPAGGSSWSRGL 

RIVCYGQWGRSCQQCPHQHSNCCCGPDPVS 

WEGAQLELGPAWL 


8 


1358 


A 


106 


3 


350 


FSSLLSGRISTLRDETGAILlDGDPAACAPnKF 
LLTEELHLRGVSIYVLRHEAQIYGITPUVCAL 
LI/C3RRL*SDSCMRAALNDRGLYQVLILDGLV 
QCLGFVDSDSRKMVSTLT 


9 


1359 


A 


115 


49 


186 


QAWAIFKGKYKEGDTGGPAVWKTRLRCALN 
KSSEFNEGPERERMDV 


10 


1360 


A 


123 


2 


1249 


KGCRTQEKVDRTEVIRTC3NPVnrSKLFTVDFY 
FEEVQRLRFEVHDISSNHNGLKEADFLGGME 
CILGQrVSQRKLSKSIXKHGNTAOKSSrrVIA 
EELSGNDDYVELAFNARKIDDKDFFSKSDPF 
LErFRMNDDATQQLVHRTEWMNNLSPAWR 
SFKVSVNSLCSGDPDRRLKCIVWDWDSNGK 
HDFIGEFTSTFKEMRGAMEGKQVQWECINPK 
YKAKFCK>rifKNSGTVILNU3aHKMHSFLDYI 
MOGCQIQFTVAIDFTASNGDPRNSCSLHYIHP 
YQPNEYLKALVAVGEICQDYDSDKMFPAFGF 
GARIPPEYTDSHDFAINFNEDNPECAGIQGW 
. EAYQSCFVPKAPTFTGPTNICPHSSRKVAKFRR 
SEGN*HQGRAFAnFILVDPGQVGVYSQDMGP 
DNPGGHFV 


11 


1361 


A 


147 


614 


9 


ACARKQLLGRTVFIWFVGQLLGGELKGYSKT 

NTTSSRPASSRGVTLSSSSSSSSSLTKDALPSSL 

KSDSTTITSGLVFPFRSLCVNPAKSSVSESVSSI 

KDLLSSSVKYLE'KRTSCCTPDSSESKLSQLSS 

DSIVSMGTSSRKPTNSSSSLGALKMSATS\*G 

SGSESPTPFFLTGLQSPPSTOPREPGLTTARNS 

TTLTRDC 


12 


1362 


A 


177 


12 


416 


LIPSEPALDSLVDPRVRSRKQPFVIYPVYDTAI 

DTIOHFSLUJGNVGEPDMSAGFCPNHKAAM 

VLFLDRVYGIEVQDFLLHLLEGGFLPDLRAA 

ASLDT/AEIGAMDFLLS»LFILCLMMFEFIYFFI 

NIXTMNVY 


13 


1363 


A 


249 


535 


105 


WTFHRHLSPAPLIVCDQGTCVVSYYPQNIVQ 

MPDTQMEQGLN/HLFLDGNA*PHSVECYC3'S 

TFEIAKTTSFVLYFHRYRAPEVIXRSSVYSSPI 

DVWAVGSIMAELYMUIHJPGTSEVDEIFKIC 

QVLGTPKKVSTLVPKLL 


14 


13M 


A 


254 


572 


201 


YU,TXIGNUvIMLLVINADSCLRTXM*EFLGH 
FFFLDICYSSVTAQDAAEFPVS*KPILVWGYIT 
♦SFFFIFSWGTNGCLLSAITYACYAAICHPLLS 
TMVMNRPLCTATVNATNKMGFLNSQVN 


15 


1365 


A 


257 


425 


68 


THAKFLNKKFNIPKl,VIUm,VYIVKAIPTKM 
AIEFLLECDQNrr\iaiCENT*KNIAKNI*KRRV 
TFTPIEr*HPVKQMIKWQ*LTAWLRNRGYKKJ 
KQTTOSETAPSVCRNLVFDKCG 


16 


1366 


A 


263 


104 


481 


FCIFRTTEEDRGGDDCVVSVWTKQRNNSCVK 
SKDVFSKPVNIFWALEESVLGVKARQPKPFFA 
AGNTFHvnXXVSSKNKSPRYSVLIMAEKPV 
GDLSSPNETKYUSUXJDSWKLENWTOASRV 


17 


1367 


A 


298 


68 


208 


RHlTNNPIKIJ3KKFEHFKNEDI*lTSKHrKMW 
VSSLAMKEMLTKTTM 


18 


1368 


A 


300 


904 


1 


LWGlTGTRHHARVIFIFLVETGFraVGQAaL 
ELLTSGDPPALASQSAGITGMSHCARPKGHFG 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 

seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ngtofiist 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine O^Cysteine;, 
]>-Aspartic Add, EKilutamic Acid, 
F^Phenylalanine, G=01ycine, H=}Sstidine. 
P=4saleucine, K^Lysine, LpLeudne, 
M^Methionine, N=Asparagine, I^Praline; 
Q=Glutainine, R=ArgiDine, S=Saine, 
T=Threonine, V=Valine, W=-Tryptophan, 
y-Tyrosinc, X'^Jnkno'wn, *=^top codon, 
/^possible nucleotide deletion, V=pos5ible 
nucleotide inserticm 














IHLK»MFyTMSQKMP*PTINLILLLlIPGNLNIF 

KPNMGWLGPKTAFV*KDEVLSGIPFAKGRCR 

WK»DY*C/LQEVTDPIMEKGKKKKRTASFFK 

GQPHQSTNALLRRCVR*RYHLS\TVETAGLP* 

KNTGHIPGQPFLFKLVFKC*NV1CI**QYKW*Q 

NIGVKNKSFCPH'SSSPSL»FIGHHSRNF/CSFK 

TEPHSWQAGGQWRNLSSLQAPPPGLMPLSR 

ISLMSSWDYRRPPQ 


19 


1369 


A 


302 


3 


445 


nspsrwak1qmfehtfcg*gcg/er/nvhihcs 

wicrlrpllwravreylsklknaelsfdpgv 

sluayaidmptsi*dekeallfaflafhe*hc 

ksriwavlq/cahlwdwlrkl*cfhrn4kfya 

av*nkprhllshiwkdvqnilij: 


20 


1370 


A 


304 


1 


1339 


FFFCGKEVPUFEQNKHPOPRATTSPQA/HARA 

LLSAGEFTAGVGLSP*AmSFVWLCmQHGA 

GGPCMQPGGSPGPWMHTTQAGHLWEGAYPG 

GSSTWHQVPGQLGGSWGPRERSLLGSFIKCSP 

CPHPPGFRLWMSPNQKPPTENPGVMGRVWR 

LMPGESPLIWEAEGKEDHLSPEGQGHSE/PVA 

MAGQOQSRPAAR»PPCPALTPASHSAGTWPP 

RICRTVPGGPCPSPSGFRSCRR*GFSA*TRSWP 

DAEPPSTPDTAPRCCTQSDTSSQGPQ*S*WRR 

CRALPGRLCSAPAAGLRRARPRLSESRRGNSP 

PASPAAASARCPSWGPSCPARPPSRPAAGTEP 

AAPSRCTAWLRGEREPGPRPPGRRPRSGRGF 

VSFAPEVLSLPAVRQTKSWRWKNEEEITRPW 

ALVRSRGG 


21 


1371 




326 


799 


1587 


GSOVLPPPPSODSATLPODA*fiPRAAPf5r)PVP 

E*GLQGAGVRRLRGEVLCQPQP»GAL*EQCU' 

HLSFSPRQGAAPDTEPSAWGPAFTGATGPGLP 

LRHVRLFSAQAPRGAATPCPPALLHQPAWPP 

ARPMFRGHPPVRPLGPWGKVAAGPRALCLA 

GVPAVQGECATKPSG*(3L*PAHLRGPPGPEVL 

QWHWQLSAGRDPVPAEDPPL*EGPLQPGGPA 

AAQAEPGADPEPEDKDQAAESRPAGAMSLSA 

QGSGPVGGQOLR 


22 


1372 


A 


327 


146 


652 


PHLENPHPEHSFPGAPLT^STLSWSILSPREPSP 
GAPCYPGHPHLE3<PHLEHLLTWRTVTWSTLL 
PGAPCYPEHPHLEHPLTWSTPHLEHPSPGEPL 
SCRTPTRSILHRDHPLP*CLSTEESPI»GWGSLP 
APPSTPLVLDVAPPGPQPASSCPGRDSCYSVP 
GTWSP 


23 


1373 


A 


348 


397 


2 


CIVSSCQGTRKPCHLEDANKINKQSPTLEKIES 

LQESL*VKQ«LIVAEKYVQILHreKKYFQRPL 

NNEIOUCMKKRKEEKKKCRERMQRRSKWRR 

EEKKE*RREE\EERKKEKEDRKERRK£TSPRG 

SRRLLRD 


24 


1374 


A 


362 


170 


352 


GRALDTAAGSPVQTAHGLPSDALAPLDDSMP 
WEGRTTAQWSLHRKRHLARTLLVSRVRGPQ 


25 


1375 


A 


384 


373 


128 


VLnnLETGYLWKNRHSDQ*KRTENPERDQH 
KYPKVDFCKSNSMKNRLCNKWHWTNWIFTD 
EOKINLNLKPHTKLTPNIKKN 


26 


1376 


A 


397 


383 


165 


EVKmWFIFSGTOLTIWIRSI*RKSDEINQRTK 

•MEKTSIsmRRXJmOCMSFLPNUYKFNTISl 

lOPANF 


27 


1377 


A 


406 


103 


380 


IKKATGYMVNI*KLIV\FLYANDEQLEIEMNK 
IVPNFNGSKNKIAFTNLTKYQNIQNRHAENYKl 
LVNKIEDLNKWRNVLLSWIGRRNIINTTkfr 
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SEQtt) 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
pqitide 
seq- 
uence 


Met • 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


PredictBd 

bcginnmg 

nucleotide 

location 

correspondi 

ng to first 

amino add 

residue of 

pqitide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
addresidne 
of peptide 
sequence 


Amino acid sequence (A°^Alanine CXtysteine, 
D-Aspartic Acid, B=Glutamic Acid, 
F=Phcnylalanine, 0=01ycine, H=Histidine, 
I=IsoleiB»ie, K^Lysine, IM^eucine, 
M=^ediioDine, N=Aspaiagine, I*=Proline, 
Q=GIutamine, R=Argfa)ine, S=Serine, 
T=ThrBonine, V=VaIine, W=Tryptopliaii, 
Y=Tyrosine, X^TJnknown, *=Stop codon, 
/^possible nucleotide deletion, V=possibIc 
nucleotide insertion 


28 


1378 


A 


408 


14 


427 


TICTOKFNNIJ5EIK/FLERHKLSKLTQEEVBNL 

rrLKTSRETELVlNK*VIPHKEKPGPDSFTGH' 

YQTFKEEL/n/ILHKLFQTIKYGRILPNSVYETSI 

TLKPKPEKDL\KENYRPU»LSNroAK\LNKrLA 

NRI**HIR 


29 


1379 


A 


434 


395 


128 


IYSKMCMERQRLNN»ILKKNKVRGIAVPDVK 
VYYKPTVD!/rSWIL*KDSHIVEWNRLENLBID 
PN/KRLILDKGAEATEWKKDSFFRQWQ 


30 


1380 


A 


455 


2 


228 


FFFETESHSVTQAGVQWCNPGFKRFSCFGLSS 
SWDYRyAPPRP\ANFV»FLVETGFyYVAQAGL 
KLLSPGDLPALAS 


31 


1381 


A 


462 


393 


2 


QLMFDKGVKNimWGWTPHTK»YWKNWISI 
CREMNLNPYLSRYIKINSRVKDLTVRPEPIKLV 
EENTGKTIQDT<HX}K*F1AKTSKAQSTKTNK* 
KRQTRYIKLX\KKSTASKENNRVKRQPLE«EK 
IFAN 


32 


1382 


A 


474 


125 


471 


VKPYEIAVFLVKPIEyK*HLLSDPAIPLSG]*LK 
EIKAYT/R1UCTPMFAAPVSV1A/KN*KQSK/CQ 
KQ*YVHRMEYYTTIKRSEIUCTTTWVDFKNT 
ILRETDRIHKTTYDVISU 


33 


1383 


A 


488 


1825 


2 


KSACSFICSEEQPASPSPLKPGTYASETVRPRDP 

HAAGPRRDSSEAETRRPRGA/DGSGTWKGT 

PGSPAPPCSWGHGG\ETEGAG*CPAAPGTDLR 

APGGSAGSnGLPSAGGSRGRKGWRAAGRQP 

STR*ORPGRHGGR0E*AGHPEPRQSALQSAG 

L/ASSPEPMGAALAEDGSGDSRGAGPRPQE*P 

PSVLSRS\GS*G*G*AASGTASSPRSHSSRLGPP 

SAGFHGLRCX3QPPFAAAPPGPWPGTGRPAGG 

AGSPPAAAGTAPPATRQAQSRRQNRTAGRNA 

SPQTAAGAGSPVQWALSRATG*TGETGSWC 

AGGTHQATHLTAAWVCEPTWSVRPGGSGPA 

AGLGR*GRHPAQSPPLPVPRO*PAWPQEAPSP 

SPASSEVALSSGSCWPDQAPGPARGSPPAPLA 

PAWPAAGRGRQR»GRQSAHPPPRR»STAVSL 

SGTS*WRRSP*AGTRTQQC»SPWLVPACSSRP 

L*RGTRRPSTQQSPQTrGTPGRSAGPGHPRS* 

GGRSPAGTGHLGAQTVASPH*GHWPTALSCL 

WASASPPGPEAPPQTGAaOTNCRYRAASAR 

RSSVAPACA*GWQ*AGSPPAVLRGPP*RVRER 

GALTHRPRAPDE 


34 


1384 


A 


497 


422 


2 


APGASVGRAQAAEG*RGGPTGRPPSALGVS/E 
AGRAGRAGEGRFVPPAYPLCKSAQTSGPPKA 
RLSVPPLASCOOROPPGOAACATCAPPAGPAR 
SSRCRRRSPPE«GPR*PSRPARPSPGSAASRRQ 
KLTPCRCQFRGLCA 


35 


138S 


A 


509 


IS6 


475 


PTPYPGE*QAAFLLRGPGLRPPA/DPSLR/HRN 
LTELWAVTDENIVOLFAALLAERRVLLTAS 
KLSTLTSCDHAFCALLYPMRWEHVLIPTUT>H 
LLDYOCTPLPRT 


36 


1386 


A 


.512 


3 


1631 


FFFSFVCHLYCVSPTPGPHGRLATWIVPGLLA 

FLGLAAGGQTLCPAGELPGHARAQASGAPGS 

VLIAVP0RRRVHTCGPGPAAPSTRGEC3WAL 

GHTRPARPRPV\PFAPAVPQEPGGQGHGAA/P 

PATGHSAPRGCPPARAAPTGSAIPAPPPAACA 

AFHSAWSVPPA(»QQG*RVPAPAFRRTTPGT 

PGQHLLDRPGAPPAQGSGPAPAPPPRLAGPA 

ffl>AAPPPGPPAASWHSSLSKSSSSL\GWSPPLP 

VGPGSLQ*TPPPQGPHLSGSCGGTSSWRGQR 

AAVARRLRSWNACOLSRVAGRSSASYPORE 
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SEQ ID 
NO: of 
nud- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
Deotide 
seq- 
uence 


Met 

hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

iocaticn 

cone^Kmdi 

ng to first 

amino acid 

residue of 

pq>tide 

sequence 


Predicted end 
nudeotide 
location 
OHTespondiiig 
to last amino 
acid residue 
ofpq)tide 
sequence 


Amino add sequence (A^Alanine OCysteine, 
D-Aqpartic Add, EKjiutamic Add, 
F=Phenvlalaninc 0=^lvciiie- H=flistidine. 
Msoleucine^ K=i^ysine, L=LaiciDe^ 
M=Metbionine, I^Asparagine, INPioline, 
Q=Glutamine, R=Ai:^nine, S^Serinei, 
T=Threonine, V=Valine, W=TiypU^)iiaa, 
Y=Tyrosine, X^Unknown, *=Stop codon, 
A^XKsiblc nucleotide deletion, V^xissible 
nucleotide Insertion 














GRPSOSO*PAOPPGMRCim llOW*P<55JSf!Sn 

GPGPHPASTWLRAGKTGPSPPACGCA*LPPPS 

VSAAPQSPRTRCPRGCAAAAGLCVLAAAOAS 

HaA\OLPaVRVHrQRVHIH*0AC3/0CQTPRPR 

LRSLPVLGLPAPRCTVSAHPWHRRSGSSCWA 

ARLVPRHPAPGCa>*»TG*\PUTGFPEP*A*GLP 

NHQAVGLEASGALQAOHRDELPTMVQLLDH 

SPDYPLKGRPHAP 


37 


1387 


A 


620 


828 


1 


FRUPLAAOA/RGAAEPRVAVSMAPDPSAKIH 

RLNSTHLYACGTHAFQPLCAArDAEAFrLPTS 

FEEGKEKCPYDPARGFTGUIDGGLYTATRYE 

FRSIPDIRRSRHPHSLRTEETPMHWLNG*EDE 

AQDDGG'GTISSFLLPWPADHPTPKSPGEPVH 

oir V 1,^1.^ V iS\j\^t\io\j\JrijciorJVLfl.>K i tl 

VDAEFVFSVLVRESKASAVGDDDKVYYFFTE 

RATEKESGSFTQSRSSHRVARGIPPL 


38 


13S8 


A 


739 


1 


427 


FRAMVSSTLKLGISILNGGNAEVQ/QGNRGKG 

TSEEGKEG*EVPV*LPVSPPLPRPLQKMLDYL 

KDKKEVGFFQSIQALMQraGEKVMADDEFT 

QDLFRn^QIlX^EOHNNDFQNYLRTQTGNnT 

INinCTVDYLLRLQESI 


39 


1389 


A 


767 


1 


1030 


TLDLTGPLLLGGVPNVPKDFRGRNRQFGGCM 
RNLSVDGKNVDMAGFIANNGTREGCAARRN 
FCDGRRRQNGGTCVNRWNMYLCECPLRFGG 
KNCEQGEWPASSIPPVTAAWEALLLDVPGTT 

vo/^f vifW/^fYOl \nrvA Avi'\fT\czmm f\Dn^n 
VKuL<rUl^VKv^rJL.VV I AAri VlJonKrl-.v21il vL 

RRAPAPASGVPSPSGVGWDR*AGPAEPSPSTP 

ATVnSVP\VYLGLMFRTR\KEDSVLMEATSGG 

PTSFRLQVTGAPCHQGTC*VGARGRDPMLSG 

I IJVTTM^PWMHT T TUT V~KI\n^'PT\CD\4W1 \JTkA 
UK. y I UXjJo WrltilAAJii^nM VlUjd^cMKxlL* V i I» 

TLDYGMDQVSWHLHLLWQ*TLPPAQGKTGA 
QAAPRL 


40 


1390 


A 


801 


69 


399 


MKinHKEDLNKWKYILCSGMERLSTVMIPVV 
PQIIYKTOA*QVVILlCFTW*E*GAKmLRKNKL 
RGLVLVELSTC*VKYLLDKVLPHIKTyYEAR 
VNKSWT VnVTTM 


41 


1391 


A 


833 


7 


195 


SMLKERKWQFPSOJTQYrrWLGPPYHVLFD 
SSVTNFSIGAK'DILQSVMNCLYAKRIPCVT 


42 


1392 


A 


841 


1 


415 


GSTHASGYDKTPDFILOVPVAVFfiHirHWIFS 

KASFGDECSHHAYLHDQFWSYWNSLKHRTW 

QGIGTVASNLSQL*TLNAPFPELLU'RSLARTG 

FVLTnRFGPGLYIYWYGHQEIJDCNRERGILL 

KACFPTTflVTL 


43 


1393 


A 


843 


358 


92 


PALSPAP VPOKKGSPLPLDPCLGPS55WLI SSVG 
LQWPRL*PRRGPGDPGSLPATPPLLTPPHnXP 
QRHVELPPSHAQLARPPPPEPISVP 


44 


1394 


A 


833 


452 


1 


LPOYCFFPRLSPKSKLVKHSAL»*PSALICPPTK 

SPROPRTSLYFnCC/PPALQL/SPIEDPPAIYRS 

PPTHMLRSASQn,NQAPTLA/KGHPPSRFLQG 

QVSCPPQPTLPREKPLPLHURPPPRPAQPPLPR 

PLTFSTRRNVDPEIPERFR 


4S 


1395 


A 


894 


379 


162 


GVYPPTVFDNYSVQTSVDGQIVSLNTWDTAG 
QEEYD/RLRTLS*PQTSIFV1CFSIGNLEFPIYGT 
WLSMSVfGK 


46 


13% 


A 


900 


1 


366 


TTKKTLISNNVSSRSLPILPELKAFSLAFNDPL 

mQKYMRT/DQ*CVTHDISLYIVTKLAUFLIPR 

VF1J^QLNIT**CLHFFTMTTFIAIITSFIJFLGR 



137 



wo 01/57188 



PCT/DSOl/03800 



SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
p^tide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 

Ul 

09/496 
914 


Predicted 
beginning 
nucleotide 

correspondi 

no tn firvt 
tuuiiiu ouiu 

residue of 

peptide 

sequence 


Predicted end 

nucleotide 

locsaoo 

to last amino 
sequence 


Amino acid sequence (A=Alanme 0=Cysteine, 
D^'AqKDtic Add, E^Olutamic Acid, 

x ncuyiaiaDinc, o^vrijcinc, n^niouonic, 

M^Mediionine, N=Asparagine, P=ProIine, 

T=sThT»minp V=Vnlm& W=sTrvntonhfln. 

Y^Tyrosine, X-Onknown, .*-=Stop oodon, 
/=possible nncleotide deletion, V^ssible 
nucleotide insertion 














DyKSLAMLPRLVSNSWPQVILPP 


An 




A 
t\ 


ytH 






07 Cnn A<!ROn *<!OI I RBI URFIMHI NPOfiOfr 
CSEIAPVCTPAWVTQRDFFRKKK 


48 


1398 


A 


963 


216 


308 


HFTPDRIAIVKNraDSHCWRGC»EEGAPARC 


49 


1399 


A 


967 


466 


1 


PRKRESWWGERLP/PRGFPPAAEDAPAPGWK 
GRKHASRTARAHVFHPIRQSIRSPVRGRPGDP 
KAAnll<oAuilu^QoJ\AoKUO*uKvjrAr 1 K*ll 

GGPGSAPAPLPASSOCSUTDSSPWTTPPPAPG 
AAAAQP**TPRCPAALRAGAfflGRVGRPY 


50 


1400 


A 


973 


45 . 


421 


EKaQAlX)VFVFCYIDHSSHCLMSCD»E/DQA 

LNFMPLEMEKCMSKLAFGCQRSSTSDDDSGC 

ALEEYAWVPPGLRPEQIQLYFACLPEEKVPy 

VNSPOEKHRKQLLYQLPPHDNEVRYCQSLSE 

E 


51 


1401 


A 


992 


2095 


194 


nURHEAARSCLGCAAGHVPAPGLRIXPTVRG 

PPGRRGPAAPGCVCY»SGESTFVSHVPQRMA 

WPGSAPPRGFHPLQSQTSPSDTVSSPQLSKEE 

DGPGWEHPLSSSL*SLGQAGGNH»QPEELAG 

WEPRGPPSLAPSSFT/TMWTALVLIWIFSLSLS 

ESHAASNDPRNFVPNKMWKOLVKRNASVET 

VraflCTSEDVTMAAASPVTLTKGTSAAHLNS 

MEVTTEDTSRTDVSEPATSGVAADGVTSIAPT 

AVASSTTAASnTAASSKTrVASSAPTTAASST 

WASIAPTTAASSMTAASSTTMTLALPAPrST 

STGRTPSTTATGHPSLSTALAQVPKSSALPRT 

ATLATLATRAQTVATTANTSSPMSTRPSPSKH 

MPSDTAASPVPPMRPQAQGPISQVSVDQPW 

NTTNKSTPMPSNTTPEPAFITTVVTTTKAQAR 

EPTASPVPVPHTSPIPEMEAMSPTTQPSPMPYT 

QRAAGPGTSQAPEQVETEATPGTDSTGPTPRS 

SGGTKMPATDSCQPSTQGQYMV/DHH*APHP 

GRGRQNSPSGOAVTRGDPFHHSLGFVCPAGL 

*iu^ljlil^l-.Hi^j01A>NQKD VUoJLKrl VKUAuA 

WREAVmJTUTUJOJlPNQVLPNSFGAIEEIC 

QMLKHt 


52 


1402 


A 


994 


1 


462 


ESGEFLVSFn.KKPTNVnfflmGMKFFNK/LIF 
*SIirDIAFYKIQHPFMLKAI,TKWA*EGT*PDR 
RYLH* SLRLNGEQLIOTPLRSGMR*G/CA]LPL 
VLNAMLSrVPAWPAGKTRHEKElTCPUGQE 


53 


1403 


A 


1011 


1 


630 


PEYIQQSAYDSKADIWSLGITAmAKGEPPNS 
DMHIWVLFLIPKNNPPTHCWHRtLESFKEV 

SYLTEUDRFKRWKAEGHSDDESDSEGSDSES 
TSRENNTHPEWSFTTVRKKPDPKKVQNGAEQ 
DLVQTLSCLSMIITPAFAELKQQDENNASRNQ 


54 


1404 


A 


1016 


I 


222 


ISIDA*KAFDKIQH/CFMrnLKKLGIDGKYLN 

TIKA1DDRHTVSTILNVEKLKAFL*RSGTRQRF 

PISGSGARI 


55 


1405 


A 


1033 


3 


366 


HASVDGDEGSDDVYYYYIPAILREUJALNTA 
EAAEHRPEEDRMLSEDPWRPAHMIKGYMPL 
HNiraTEVIDVTGLNQSHLYQHLNKGTPNlKT 
QKRAAVLYTWHVLEQLEDLRQINQQSHGPG 


56 


1406 


A 


1044 


5 


429 


SVLTLQTRSPSKPLS\RKUMDWEWSRNSISE 

DRLETQSRASRSPPVTPNQSQETPVDGKPLAL 

PPNQSQKNIRYHIHYLHLQYYLDRHISATLPIP 

SSSGIPTPIAVrrDALTDLVEULOQPCSEESOR 

APQTLFLLAL 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

lesidue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
correqx)nding 
to last amino 
acid residue 
of p^de 
sequence 


Amino acid sequence (A=Alanme 0=CystBine, 
D=Aspartic Acid, EKjIutamic Acid, 
F^^henylalanine^ G=Glycine^ H=Histidine, 
I-Isoleucine, KHLysine, IM^ucine, 
K^Mediioaine, N=Aq>aragjne, P=Fn>linc, 
Q=Ghitamine, R=Aigiiiiiie, S=SHine, 
T=Threonine, V=Valine, W=Tiyptoijian, 
Y=Tyrosine, X==Unknown, *=Stop codon, 
/^possible nucleotide deletion, V=possible 
nucleotide inseitioD 


57 


1407 


A 


1050 


11 


430 


GAYAFETNGFPIMLVLTTDKIEGDVGIAGLYD 

MH\ISIJMAFIXRTLVRCrrSYIIPVTHVLSTPV 

TCLRRREKDGVIVDVLSDTASNHNGFPVEEH 

ADDTHPAItt,Q<aTrLRSQPMGPLKHKAFEERA 

NLGLVQRRLRLED 


S8 


1408 


A 


1058 


258 


419 


IJKHRDTPVVGANNRALSCrPLTSLTLCALCPL 
PCLGCPTXATC3tLYQTTVAWF 


59 


1409 




1064 


3 


425 


KAFSFTTSLIGHORMHTGERPYKCKE(!XJKTF 

KGSSSLNNHQRnrrGEKPYKCNECGRAFSQC 

SSUQHHRIHTOEKPYECTQCGKAFTSISRLSR 

HHRIHTOEKPFHCNECOKVFSYHSALIIHQRIH 

TGEKPYACKDVGK 


60 


1410 




1065 


204 




LLLAVQQSCLADHLLTASWGGK/DPIPTKALG 
EGQEOLPLTV 


61 


1411 


A 


1079 


3 


383 


RHSRAHLCQPFHLVMRDLLQLGQDIFQGCHY 

LEEhniLIHRDIAAKNCLLSCAAPTRAATIGDF 

GMARYIYRTRYYQLGDRAL/LPRKWMPPEAL 

LEGIFTYNTDSWTFGVLLWEIFSLGYMPYPGR 

TN 


62 


1412 


A 


1080 


1 


859 


WEFLWSRRPSOSSDPRPRRPASKCQMMEER 

ANlJvHMMKLSIKVLLQSALSLGRSLDADHA 

jriA^^^rjT V vJvLcricj..«Axivii-r^v AJ^ioriijv/iNcwoFr 

GPLELVEKLCPEASDIATSVRNLPELKTAVGR 

GRAWLYLALMQKKLADYLKVLIDNKHLLSE 

FYEPEALMMEHEGMVIVGLLVGLNVLDANLS 

CLKGEDLDSQVGVIDFSLYLKDVQDLDGGKE 

HERITDVLDQKNYVEELNRHLSCTVGDLQTK 

IDGLEKTNSKLQERVSAATDRICSLQEEQQQL 

REONELIR 


63 


1413 


A 


1083 


2 


615 


SSFAKHKRIHTGEKPFICLECGKAFTSSTTLTK 

HRRIHTGBKPYTCaEECGKAFRQSAILYVHRRI 

HTGEKPYTiCGECGKTFRQSANLYAHKKIHTG 

EKPYTCGDCGKITRQSANLYAHKKIHTGVEKP 

YKCKECGKAFKSYYSnjOlKRIHrRGMSYEG 

DEaQRSLN/RSSILSNHKIIHNEEK/PLKCEKCE 

KAFNHTSICCRHKKN 


64 


1414 


A 


1084 


946 


1 


KKC»5LSSSLTDDSKNAQAPLALTESHLATLA 

SSSQSPEAKQLLDSGLPSIXVRSLASFCFSHIS 

SSESIAOSIDISODKIJlRHHVPOOCNKMPrrAD 

LVAPILRFLTEVGNSHIMKDWLGGSEVNPLW 

TALLFLLCHSGSTSGS\HNLG\AQQDQCKISFS 

FFSWLTTGLTTQQRTAIEVNATVAFFVLQC]\SC 

HPNNQKLMAQVLCELFQTSPQRGNLPTSGNI 

S\GFIR\RIiTX}LMLEDEKVTMFLQSPCPLYKG 

RINATSHYIQHFMHYGAGHKPRTLHLPVSTTL 

SDVLDRVSDTPSrrAKLISKQKDDKKKK 


65 


1415 


A 


1087 


103 


324 


PRAFEFVH'IEMIVG^IVQNIHLPTLQVLEDRA 
LFTMSVOSSLWSTYIJHVM/SIP/DRELLKPNA 
SVAJLHKLSNALV 


i6 


1416 


A 


1095 


3 


493 


HETCSVTHIVSFSLPFLNPSHPASTPGHTENEQ 

PSLVWFDRGKFYLTFEGSSRGPa^,TMGAQD 

TLPVAAAFTETVNAYFKGADPSKCIVKrrGE 

MVLSFPAQITRHFANNPSPAALTFRVINFSRLE 

HVU»NPQUX:CDNTQNDANTK\EFWVNMPNL 

MTHLK 


67 


1417 


A 


1098 


57 


356 


LKLTSLOFUGVSWGNLLISnXVKDKTLHRA 

PYYFLLDLCCSDILRSAICFPFVFNSVKNGST 

WTYGTLTCKVIAFLGVLSCFHTAFMLFCISVT 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

coirespondi 

ng to first 

amino add 

residue of 

p^tide 

sequence 


Predicted end 
nucleotide 
location 
conesponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequeoce (A=Alanine OCysteine, 
D^Aspaitic Acid, E-Glutamic Acid, 
FHPhenylalanine, G=Glycine, H=Histidine, 
Msoleudne, R^Lysine, Lr=Leucine, 
N^Metbionine, N=Aspaiagine, F^Ptoline:, 
Q^Olutamine, R=Argbiine, S=Serine, 
T-Threonine, V-Valine, W=Tiyptophan, 
Y=Tyrosine, X^^nknown, ♦=Stop codon, 
/=possible nucleotide deletion, V=f ossible 
nucleotide insertiDn 














RYL 


68 


1418 


A 


1106 


1 


1326 


MGKISATGINMGTKCSWALVWHLESYDPKH 

YEREGMQDWKTASOQSEEATQQSSQKPQPH 

YTTYQSSSFLKYSSESHLLAWRENSSEGSFQF 

PCBRSRARPPRTRQQRRGAAAGPGRGAVRLG 

HPQSAAQPQLRAAARIPESPAAFPAQPRPGSA 

RNSDASGPASLSRTLGRASSPRPPQAPDVTAP 

SPAALAPRAAROOSRAAALAOAEAEEPLRTL 

APRPTRAAAPPPPPPPPPLPPGAPPPPVRCVSR 

RARAPPWR/PAATGPPPVRPVAPSRKLGSARAP 

APALQIRKGTSSGLPGRGGGSGPGNNLSSVA 

GNWRGSSFAVHJPGMAKYQGEVQSLKLDDD 

SVIEGVSDQVLVAVWSFALIATLVYALFRNV 

HQNIHPENQELVRVLREQLQTEQDAPAATRQ 

QFYTDMYCPICLHQASFPVETNCGHLFCGSLT 

PNSIW 


69 


1419 


A 


1107 


2 


466 


FDTARLHEFGTSITQIFAVDNREDLQKWMEA 

FWQHFFDLSQWKHCCEELMKIEIMSPRKPPLF 

LTKEATSVYHDMSIDSPMKLESLTDnQKKlEE 

TNOQFLIGQREESLP/SS/CGPHSIMVnKWSS 

RKRY/SYPASEPLHDEKGKKRQAPLFPSDK 


70 


1420 


A 


1111 


698 


23 


ALRRLHYVRATKV\FLSFRRPFWREEHIEGGH 

SNTDRPSRMIFYPPPREGALLLASYTWSDAAA 

AFAGLSREEALRLALDDVAALHGPWRQLW 

DGTGWKRWAEDQHSQGGFWQPPALWQT 

EKDDWTVPYGRIYFAGEHTAYPHOWVETAV 

KSALRAAKINSRKGPASDTASPEGHASDMEG 

QGHVHGVASSPSHDLAKEEGSHPPVQGQLSL 

QNTTHTRTSH 


71 


1421 


A 


1119 


2 


383 


QKQTLQNGYLDSSMDILYLGSLPPELQVSSDE 
PPGPPEQAGLSQFHLEPETQNPETreEIQSSNLQ 
QEAAAQUQLPEWELSSTKAMEAPALPSQSL 
EGVHSSTEQKAPAQQLPAFEEILAPLUHHE 


72 


1422 


A 


1127 


1 


906 


HAQYVGPYRLEKTLGKGQTGLVKLQVHOT 

GQKVADCIVNREKLSESVLMICVEREIAILVRU 

BHWrnjOHOVYENKKYEPPDELTSOPSMLA 

QVSPHOKLSARRSWDLLSQFPRYLVLEHVSG 

GELFDYLVKKGRLTPKEARKFFRQIVSALDFC 

HSYSICHRDUa^NLLLDEKNNDUADFGMAS 

LQVGDSLUnRSCGSPHYACPEVIKGEKYDGR 

RADMWSCGVILFALLVGALPFDDDNLRQLLE 

KVKRGVFHMPHFIHDCQSLLRGMIEVEPEKR 

LSLEQIQKHPWYLGONnS 


73 


1423 


A 


1128 


1 


802 


URNAIX)VLHREVPRVLVNLVDFLNPTIMRQV 

FLGNPDKCPVQQA/MLEHXJSKTETLDLRAE 

MPrrCPTQNEPFLRTPRNSNYTYPIKPAIENWG 

SDFLCTBWKASNSVPTSVHQLRPADIKVYAA 

LGDSLTTAVGARPNNSSDLPTSWRGLSWSIG 

GIXn^THTTLPNILKKFNPYLLGFSTSTWEG 

TAGLNVAAEGARARDMPAQAWDLVERMKN 

SPDINl^naWKLVTLFIGGNDLCHYCENPEA 

HLATEYVQfflQQALDlLSE 


74 


1424 


A 


1139 


60 


480 


FREPCIiVPODHQPLREASWLA/LPPIGLWGT 

DSPLCCVEVAIPCNKGAHSVGLKGWLLAQG 

VUSMRinTPQEHPWESTPDLCFCRDPEEIEVE 

EQPAADAAVAKGEF/QGEQIAPVPAMIAAHPE 

AADPAPVHTTAHPKGA 


75 


142S 


A 


1147 


2 


413 


PFPHQHPQBFVKOSCWPQSALROQCPOPVLGV 
TTTSDLCSLQVPVSSHRNPLLDLAAYDQEGR 
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SEQ ID 
Nttof 
nucl- 
eotide 
seq- 

UCDCC 


SEQ ID 
NO: of 
peptide 
seq- 
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hod 


SEQ 
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in 
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Predicted 

beginning 

nucleotide 

location 

corre^ondi 

ng to first 

amino ecid 

residne of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
conesponding 
to last amino 
add residue 
of peptide 
sequence 


Amino add sequence (A=Alanine O^Cysteine, 
I>=Aspartic Add, EKjlutamic Acid, 
F=4'henylalanine, G=Glycinc, liNHistidine, 
I=Isoleucine. KHLysine, L=Leucine. 
M=Methionine, N^Asparagine, P=Proline, 
Q=Glut8mine, R=Argiiune, S=Serine, 
T=Tlireonine, V=VBline, W=Tiyptopiian, 
Y=Tyrosine, X^Unlcnown, *=Stop eodon, 
/^possible nucleotide deletion, \r=possible 
nucleotide insertion 














RFDNFSSLSIQWESTRPVLASIEPELPMQLVSQ 

DDESGQKKLHGLQAILVHEASGTTArrATAT 

GYQESHLSSAR 


76 


1426 


A 


1155 


38 


410 


PnSAPAQDDPILI^ITHClHAhlLLCVWRRDVK 

PDCKEIWIFWWGDEPNLVWQYIMNCMLWK 

KDSGKIkMFPMNVGRCVKFKHniNLLERCLMD 

KNFVLIGKWFVRPYYKDEKPVNKSBHLSCAF 

T 


77 


1427 


A 


1162 


526 


350 


RFPQGLEDVSTYFVUEElXSRGWSmELQGV 
LRGNLLRVFRQVEKVQEENKWQSPLED 


78 


1428 


A 


1171 


1 


1293 


MAESASPPSSSAAAPAAEPGVTTEQPGPRSPP 

SSPPGLEEPLDGADPHVPHPDLAPIAFFC3LRQT 

TSPRNWCKMVCNPWFECVSMLVILLNCVTL 

GMYQPCDDMDCLSDRCKILQVFDDFIFIFFA 

MEMVLKMVALGIFGKKCYLGDTWNRLDFFI 

VMAGMVEYSIJDLQNINl^AIRTVRVLRPLKA 

INRVPSMRn-VMJXDTLPMLGNVLLLCFFVF 

FIFGllGVQLWAGLLRNRCFLEENFnQGDVAL 

PPWYQPEEDDEMPFICSLSGDNGIMGCHEIPP 

LKEQGRECCLSKDDVYDFGAERQDLNASOL 

CVNWNRYYNVCRTGSANPHKGAINFDNIGY 

AWIYIFQVmJEGWVEaMYYVMDAHSFYNn 

YFILLDVSVREPGLLGGSFSTAQSPKCXJGDSFP 

GVAAESLLLRGWVLWLPGGG 


79 


1429 


A 


1175 


1 


405 


PNDFFKDMFTOLPGGPLGPDCABNDYGAYLN 

FLSATHLGGLFPPWPLVEERKLKPKASQQCPI 

CHKVlMGAGKLraHMRTHTGEKPYMCnCE 

VWTTlQDKLJaHNlRKHTGERPYLClHCNAKF 

VHNYDLKNHMR 


80 


1430 


A 


1182 


25 


198 


EMNELSQQLSQQGGRGASQCPSPPAPTLPNPT 
PLCXJLQLQRVNTGUPTPPCHPOAGAA 


81 


1431 


A 


1186 


234 


583 


KTVLDVGAGTGDLSIFCAQAGARRVYAVEAS 
AIWQQAREWRFNGLEDRVHVLPGPVETVEL 
PEQVDAIVSEWMGYGLLHESMLSSVLHARTK 
WKDGGPFLPXSSELFM 


82 


1432 


A 


1187 


2 


716 


DFVDAARNLPLESTKSPAEPSKSVPSLE\DPRA 

SSQGLPSQGPVQNQGRRGEQRPKKF/rVIQHT 

SSFEKSDSLEQPSGLEGEDKPLAQFPSPPPAPH 

GRSAHSLQPKLVRQPNlQVPEILVTEEroRPD 

TEPEPPPKEPEKTEEFQWPQGSQTLAQFPVEK 

LPPKKKRLGLAKMAQSSGESSFESSVPLFRSP 

SQESNVSLSGSSRSALFERDDHOKAEAPSPSF 

DMGPKPLGTHMLTV 


83 


1433 


A 


1188 


517 


804 


ESPGLSKVLRTGAFAYPFLFDNLPLFYRLGLC 
WGRGHGCGQEALSTSHGYHLFCALLTGFUA 
SHLPERLAPGRFDYIGHSHQLFMCAVLOTHF 
Q 


84 


1434 


A 


1192 


45 


476 


LGDVGFWVERTPVHEAAQRGESLQLQQLIES 
GACVNQVTVDSnPLHAASLQGQARCVQLLL 
AAGAQVDARNnXJSTPLCECLRtGQHRVCBA 
LAVLRGQGQPSPVHSVPPARGLHXREFRMC* 


83 


143S 


A 


1194 


69 


410 


KRSEEASAPPFPLGGTGAAPTRASLPEQILLPR 
Sa-EARKSQPDEKL1,SALHNSRTWN*EPRRSQ 
HRLVSPEVHPGRRGSSPGVAECKLTSAYFRT 
GRSPCPSLPGTTRTNSLL 


86 


1436 


A 


1215 


3 


405 


LPSHTCONPGRLFNGIQQOSTFNLGDKVRYSC 

NLOFFLEGHAVLTCHAGSKISATWDFPLPSC 

RADDACGGTLRG/AEWHHLQPPLPLG/ATKN 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
DCDtide 
scq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

cori'espondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 

Incatinn 

coirespcHiding 
to last amino 
add residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
I>=Aq>artic Acid, E^Olutainic Acid, 

I^Isoleudse, K=i,ysine, L=Leucine, 

Q=GJutamine, R^Ai-ghiine, S^Serine, 
T=Tlireonine, V=Valine, W=Tiyptophan, 
Y=Tyrosine, X^Unloiown, *=aop codon, 
A^Missible nucleotide deletion, ¥=possiblc 
nucleotide insertion 














NAIXnWllLAEUiUnALWIDFQLEDGYDFL 
EVTGTEGSSLW 


87 


1437 


A 


1216 


226 


964 


o I /u\r \jrvfi V Kjr IjAInivKAVjKL.i^oI-» V JLu V 14^ V 

VIWLAFNYWSISSRHVLLQEEVAELQGQVQ 

RTEVARGRLEKRNSDLFAWGHAQETDRPEG 

GRURPPQQPAAGQRGPREEMVEDDKVKLQNN 

ISYQMADIHHLKEQLAELRQEFLRQEDQLQD 

YRKNNTYLVKRLEYESFQCGQQMKELRAQH 

NNQWPKNIPKVAENVADKNEEPSSNHIPHG 


88 


1438 


A 


1218 


1 


534 


PEFGTnSCGYLMATDVSRRPSVHKAVEIEQE 
RVKSAGAWnHPYSDFRFYWDLIMLLLMVGN 
LmjPVGITFFKEENSP\PWlVFNVLSDTFFLLD 
LVLNFRTGIWEEGAEHXAPRAIRTRVLRTW 
FLVDLISSIFVDYIFLWELEPRLDAEVYKTAR 
ALRIVRFTKILSLLRL 


HQ 
oy 




A 
A 




1 
1 




MGf'UEVl-MlNLRRRQDRRiiKMLRALQAQEIE 

CTOLVEAVDGKVGMLTRSNAAPGRHLAMLET 

LVVVAPM^ADNIJQLNPDTLSLLIAENKTVV 

APMLDSRAAYSNFWCGMTSQGYYKRTPAYI 

?IRKRDRRGCrAWMVHSTFLIDlJRKAASRNL 

\AFYPPHPDYTWSFDDnVFArSCKQ\AEVQMY 

VCNKEEYGFLPVPLRAHSTLQDEAESFMHVQ 

LEVMVPSSPSSAQSMAWSADHIGLVISYL 


90 


1440 


A 


1227 


2 


349 


NKTSFIFYLKNIWADUMJUFPFRIVHDAGF 
GPWDFKFILCRYTSVLFYANMDTSIWLGLIT/ 
YDRY/WKWRHL/WDSWMTGI/SFTRVYLLG 
LGARLVWFGKLILAKGGHGGISWL 


91 


1441 


A 


1245 


3 


1937 


LGSSDVRAPQRSELGAESPSRMVASQAYNLT 

SALTPILTRSRVLNEEPLTLAGF\SRAPANLSD 

WQLIFLVDSNPFPFGYISNYTVSTKVASMAF 

QTQAGAQIPIERLASERArrVKVPNNSDWAAR 

GHRSSANSVWQPQAFVGAWTLDSSNPAAV 

LHLQLNYTLLDGRYLSEEPEPYLAVYLHSEHl 

PNEHNCSASRRKPESLQGADHRPYTFFISPGT 

RDPVGSYRLNLSSHFRWSALEVSVQLYTSLC 

QYFSEEDWWRTEGLLPLEETSPRQAVCLTR 

HLTAFGTSLFVPPSHIRFVFPEPTADVNYIVML 

TCAVCLVTYMVMAAILHKLDQLDASRGRAIP 

FCGQRGRFKYEILVKTGWGRGSGTTAHVGIM 

LYGVDSRSGHRHLDGDRAFHRNSLDIFQIATP 

ilolAjoM WKiKV WilUNtvOLoJrAWrl#QHUVKU 

LQTARSTEFLVNDWLSVETEANGGLVEKEVL 
AASKASFRVPTPS\AALLRFRRLLVAELQRGF 
FDKHIWLSIWDRPPRSCFTRIQRATCCVLUCL 
FLGANAVWYGAVGDSAYSTOIVSRLNPLSV 
DTVAVGLVSSWVYPVYLAILFLFRMSRSKV 

GKOVHQRLLGKGQHT 


92 


1442 


A 


1246 


S 


562 


VFDEENILNELNDPLREEIVNFNCRKLVATMP 

LFANADPNFVTAMLSKLRFEVFQPGDYnREG 

AVGKKMYFIQHGVAGVriKSSKEMKLTDGS 

YFCffilCLLTKGRRTASVRADTYCRLYSLSVD 

NFNEVLEEYPMMRRAFBTVAIDRLDRIGKKN 

SILLQKFQiOJLNTGVFNNQENEILKQIVKH 


93 


M43 


A 


1249 


180 


901 


TVPPPPGGPSPAPLHPKRSFTSTGEAELKEERL 
PGRKASCSTAGSGSRGLPP\SSPMVSSAHNPN 
KAEireRRKDSTSTPNNIJ»PSMMTRRNTYVCr 
ERPGAERPSLLPNGKENSSGTTOVPPASPSSHS 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
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09/496 
914 


Predicted 
beginning 

11 UUCU UUC 

location 
correspondi 
ng to first 

amino ncifl 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
jQCciuon 
coiresponding 
to last amino 
acid residue 

sequence 


Amino acid sequence (A<=Alanine C=Cysteine, 
D=Aspaitic Acid, E^hitamic Acid, 
r=yiicnyiaianitic; u^Hjjycwc, o^^^sodine^ 
l=Isoleucine, K=Cysiw; LNLeucine, 
M=Methiomne, N=Aj{»ragine, P=Proline, 
Q=Glutainine, Rr-Aighiine, S=Serine, 
1. — iiircuuiuQ^ V— voime, w — iij'piopnBn, 
Y=Tirrosine, X=Unknovni, *=Stop codon, 
A^possible nucleotide deletion, V=possib]e 
nucleotide insertion 














LAPPSOTRSRLARGSTIRSTFHGGQVRDRRAG 
UUvivnaVj V y IN urr Aor 1 LAHKAAFLr AuRPRP 
TTNLFTKLTSKLTRRVADEPERIGGPEVTRRP 
RQEDHLSPGGRGCSEL 


94 


1444 


A 


1261 


3 


385 


KFSQWGLTKPKLSNASP/WISLVKKLMKKWS 

VTQNLTFREQLEAGIRYFDLRVSSKPGDADQ 

EIYFIHGLFGIKVWDGLMEIDSFLTQHPQEIIFL 

UJ*NHFYAMDETHHKCj^VLR]QEAFGNKLCPA 

CR 


95 


1445 


A 


1282 


2 


550 


GPRDNPG\EDPRFEIVEHFOIAWFTFELVARFA 

VAPDFLKFFKNALNIJDU4SIVPFY1TLVVNL 

VVES^TTLA^tt.GRVAQVLRLMRIFRILKLARH 

STGLRSLGATLKYSYKEVGLLLLYLSVOISIFS 

WAYTIEKEEN\EGLATIPACWWWATVSMTr 

VQYODWPGTrAGKLTASACILA 


96 


1446 


A 


1294 


1 


1456 


QLLPPSNRENAGLLVGRCLCSAALRPVGDLIT 

SSGQVAVRNAPQAGSAKAGKGKFQDNFEFIQ 

YFKKFFnANC3>IEKDYNPVAAGQGQETEVAP 

STVAPVLNKPNQCPEGYICVKAGRNPNYGYr 

SFDTFSWAFLSLFRLMTQDYWENLYQLTLRA 

AETTYMIF/LV/LVILLGSLYLVTLILAV/VAMA 

YEEQNQATLEEAEQKEAEFQQMLEQLKKQQ 

EAAQQAATATASEHSREPSAAGRLSDSSSEAS 

KLSSKSAKERKNRRKKRKQKEQSGGEEKDED 

EFQKSESEDSIRRKGFRFSIEGNRLTYEKRYSS 

PHQSULSIRGSLFSPRRNSRTSLFSFRGRAKDV 

GSENDFADDEHSTFEDNESRRDSLFVPRRHGB 

RRNSNLSQTSRSSRMLAVFPANGKMHSTVDC 

NGWSLVGGPSVFTSPVGQLLPEVnDKPATD 

DNGTTTETEMRKRRSSSFHVSMDPLEDPSQR 

QRAMSIASILTNTVE 


97 


1447 


A 


1295 


2 


2057 


IQTQLPTKSSQQLRKGGNCVRCKMQMNFIAE 

EVXXKYRJTFyNNNKGPNMLYlEIKAFVHFMI 

NRYLSYGSGPKRFPLVDVIXJYALEFASSKPV 

CTSPVDDIDASSPPSGSIPSQTU>STrEQQGALS 

SELPSTSPSSVAAISSRSVIHKPFTQSRIPPDLP 

MBPAPRHTTEEELSVLESCLHRWRTEIENDTR 

DLQESISRfflKTIELMYSDKSMIQVPYRLHAV 

LVHEGQANAGHYWAYIFDHRESRWMKYNDI 

AVTKSSWEELVRDSFGGYKNASAYCLMYIN 

DKAQFUQEVDUKTGQPLVGIETLPPDLRDFV 

EEDNQRFEKELEEWDAQLAQKALQEKLLAS 

QKLRBSETSVTTAQAAGDPKYLEQPSRSDFSK 

HLKEETIQlITKASHEHEDKSPETVLQSAIKIi 

YARLVKLAQEDTPPETDYRLHHVWYFIQNQ 

ArlUUlnKlXJLxQrGDKNl^FDERCHNIMKVA 

QAKLEMKPffiVNLEEYEEWHQDYRKFRETT 

MYLnCH.ENFQRESYIDSLLFUCAYQNNKELL 

SKGLYRGHDEELISHYRRECLLKLNEQAAELF 

JioUluJxuS V WW uL»UMN Jir t VrrLJ'lJLL VDE 

KDILAVEDMRNRWCSYLGQEMEPHLQEKLT 

DFLPKLLDCSMEIKSFHEPPKLPSYSTHELCER 

FARIMLSLSRTPADGR 


98 


1448 


A 


1304 


118 


453 


SGPSSRAIYLHRKEYSQNLTSEPTLLQHRVHI 
LMTCKQGSQRVQOPEDALQKLFEMDAHORV 
WSQDLILQVRDGWLQLLDIETKEELDSYRLD 
SIQAMNVALNTCSYNSDLS 


99 


1449 


A 


1306 


3 


1660 


CGYFCHITCAPQAPPCPVPPDLLRTALGVHPE 
TGTGTAYEGFLSVPRPSGVRRGWQRVFAALS 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
pq>tide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

conespondi 

Dgtofiist 

amino Bcid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corre^onding 
to last amino 
acid residue 
of pqitide 
sequence 


Amino acid sequence (A^Alanine OCysteine, 
D=Aspartic Acid, EKjhitamic Add, 
F=Phcnylalaninc, Q=Glycine, IHECstidine, 
Msoleucine, K=Lysine, IM^cine; 
M==Met]iion!ne, N=Aspaiagine, P=i>roline, 
Q=Glatamine, R^Argbiine, S=Serine, 
T=Threomnej V=Valine, W=Tiyptophan, 
Y=Tyrosine, X=Unknowij, *=Stop codon, 
/=possiblc nucleotide deletion, V=^x>ssibte 
nucleotide insotioD 














DSRIiJJnDAPDUU-SPPSGALLQVLDLRDPQF 

SATPVLASDVIHAQSRDLPRIJRVTTSQLAVPP 

TTCTVLLLAESEGERERWLQVLGELQRLLLD 

ARPRPRPVYTIJCEAYDNGLPLLPHTLCAAILD 

QDRLALGTEEGLFVIHLRSNDIFQVGECRRVQ 

QLTLSPSAGLLWLOGRGPSVRLFALAELENI 

EWEVPKIPESRGCQVLAAGSILQARTPVLCVA 

VKRQVLCYQLGPGPGPWQRRIRELQAPATVQ 

SLGLLGDRLCVGAAGGFALYPLLNEAAPLAL 

OAGLVPEELPPSRGGLGEALGAVELSLSEFLL 

UTTAGIYVDOAGRKSRGHELLWPAAPMGW 

GYAAPYLTVFSENSIDVFDVRRAEWVQTVPL 

KKWRPLNPEGSUFLYGTEICVRLTYLRNQLAE 

KDEFDIPDLTDNSRRQLFRTKSKRRFFPRVSE 

EQQKQQRREMLKDPFVRSKLISPPTNFNHLV 

HVOPANORPOARDKSP 


100 


14S0 


A 


1318 


918 


190 


SLCVPGPVDTGTFAVMSVMVGSVTESLAPQA 

LNDSMINETARDAARVQVASTLSVLVGLFQV 

GLGLIHFGFWTYLSEPLVRGYTTAAAVQVF 

VSQUCYVPGLKLSSHSGPLSUYTVLEVCWKL 

PQSKVGTWTAAVAGWLVWKIXNDKLQQ 

QLPMPIPGELLTUOATGISYGMOLKHMEAOX 

PPVAPNTQLFSKLVGSAFnAWGFAIAISLGK 

IFALRHGYRVDSNQVWVMRDV 


101 


1451 


A 


1353 


220 


445 


DWPDLFTYPLIGSPKCFQSARPE\RMYRRTVR 
SSHGNHALQEVLPRSGHGTEFTKQKHUEAAD 
HGHPPARMSIFSR 


102 


14S2 


A 


1363 


542 


2 


AHLLMLNLAL\TDLL\YLTSLPFLIHYYASGEN 

WIFGDFMCKFIRFSFHFNLYSSILFLTCFSIFRY 

CVnHPMSCFSIHKTRCAVVACAWWnSLVA 

VIPMTFLITSTNRTNRSACLDLTSSDELNTnCW 

YNLILTAVIXaj-LVXYTLCYrniHTLTHGHAN 

\DSCLKQKARRLTILLL 


103 


14S3 


A 


1371 


2 


410 


CHSTESSSDFILPGDYIXGGLC3'LHSGCLQ\'\C 
SFNEHGYHLFQAMRiAVEEINNSTALLPNITL 
GYQLYDVCSDSANVYATLRVLSLPGQHHIEL 
QGDLLHYSPTVLAVIGPDSTNRAArrAALLSP 
FLVPMLLEQ 


104 


1434 


A 


1376 


3 


432 


NSRVHDRS/NMSLWTQNITVCPVRNVTRIXjO 
FGFWSPWQPCEHLDGDNSGSCLCRARSCaJSP 
RPRCGGLDCLGPAIHIANCSRNGAWTPWSSW 
ALCSTSCGIGFQVRQRSCSNPAPRHGGRICVG 
KSREERFCNENTPCPVHF 


103 


1453 


A 


1379 


2 


396 


GLGLLYLIFAAVEGVMRVIGGSNHLAWLDD 

m^VIDSIFVWFIFISIAQriMKTIJUJUCNTVKF 

SLYRHFKNTUFAVLASIVFMGWTTKTFRIAK 

CQSDWMERWVDDAFWSFU^njVIMFLW 

RPSA 


106 


1456 


A 


1383 


1 


432 


EDGHGGWSSRCLVDHAEEGHREPWKRLCIW 
QRGOHEIRFAFYFPGHPLLSPQICLAPETPPRG 
CPPVSSLHFISLQ/RLPRDCQELFQVGERQSGL 
FEIQPQGSPPFLVNCKMTSGTFWTCRTDSRVF 
QNANPSNAAHSEDQPTP 


107 


1437 


A 


1386 


719 


338 


FFFVTRSHSVAQAECSOVFTAHRSLDLVGSSN 
YPALSLQSSWDHRHTWUFAFL 


108 


1458 


A 


1397 


^31 


2 


RVAISXiCAAIFiaqvIVQSAGKRWPTGVMLM 
VWLFAFLYSWPIQALLPTYLKTDLAYNPHT 
VANVLSFSGFGAAVGCCV/GGFLGDWLGTRK 
AYVCSLIj^SQUJIPVFAIOQANVWYLQLLLF 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 

DCDbdc 

seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

locatian 

concspondi 

ng to fiist 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 

Incfltinn 

cone^nding 
to last amino 
acid residue 
of peptide 
sequence 


Amino add sequence (A=-Alanine 0°Cysteine, 
I>^A5psitic Acid, E'Ohitamic Acid, 
r nicujrjajoiiuic, vj^jiyciuG^ n xiisuume, 
)=Isoleucine, K=Lysine, LM^idne, 
M=:Methionine, N=Asparagine, MYoline, 

>^ vjiuiniiniiWj 1^ — ^u^iiimp^ o — ocmic, 
T=Threonine, V=Valine, W^Tiyptophan, 
Y^Tyrosin^ X==Unknown, •=Stop codon, 
/=possible nucleotide deletion, V^xissible 














FQQMLGQGIAGILPKLIGGYFDTDQRAAGLG 
FTYNVGALGGALAPnGALIAQELDLGTALAS 
LSFSLTFWILRNRRPGKSLVR 


109 


1459 


A 


1402 




JO / 


WSHNSNSMCWGKDQCPYSGCKEAURTDGM 
RVTSRKSAKYRLQOnPRGDVSLTILNPSESDS 
GVYCCRIEVPGWFNDVKINVRLNLQRASTT 


110 


1460 




1421 


3 




LAELAFPVGVLATCA*SLLSC*YCVILFPCSCF 
FFHSPDALFSLLLLSCYFPSYCFFYYLFFSSSPL 
CLLLASSPFPLFBLLASL 


111 


1461 






■y 
it 




r la llVi 1 Kjr r«Nr.<jr A'AllAr GAQTS 1 1 AD 
QCALKPDLSYLNNSSSSSSTPATSAGGGIFOSS 
TSSSNPPVATFVFGQSSDPVSSYGFVNTAESST 


112 


1462 


A 


1434 


46 


372 


TTSWTTSCrRSCT*SGASSGPGWTPRTTWWR 
SRRSSQRTCSRACSGAWSRTW*RSS*TSSSSC 
STSCSSSSSRSCGRPGGPLGARGVHTTSCLNSC 
MSSSTTSSTTSTF 


I u 




A 

A 




3 




mil?lJVlTHYDRLVDE*AIJ>(AGKQRYEKMISG 
MYLGEIVRNIUDFTKKGFLLRGQISEMLKTR 
GIFLTFLLSNFUVCVLLFYVSFYLFQSCINFVL 


114 


1464 


A 


1463 


1 


396 


KQQAVPEPHSSTTTPQEQEQNWYGQDLLNLQ 
yK 1 It VHlJ'uHKTur AVAKinrErVKKErrVP 
ATSQGP*SPFSEEPPLM>SNEEVPPTLPP*EPQS 
EDP*KNA*LKQMHAATIHWQQHQQHQVGC 
QYHGIMQ 


115 




A 
/\ 








AOari^MVWaOHWUVTQKRKAL'VYSFEEG 
GRRKCGQYWPLEKDSRIRFGFLTVSNLGVEN 
MNHYKKSTLEILNPEVNPGFFFLTLWKQGEN 
NYCN 


116 


1466 


A 


1465 


667 


337 


LPPQRPA»TDSYSTCNVSSGFlAGQSHNnlLQ 
I W 1 Js. I y V WB WJLQiirJLUl NQLiDANCIPrQEF 
DINGEHLCSMSLQEFTRAAGTAGQLLYSNLQ 
HLKWNODSLFLCLSLPC 


117 


1467 


A 


1479 


1 


381 


GTSGGPKRVLVTERFPWQNPLPVNRGQAQR 
VLGPSNSFQRVPLQAQKLVSSHKPGQNQKHK 

NPEEELASDPNNEESL'RPWALEDFEIGRPLG 
KGK 


118 


146S 


A 


1485 


3 


385 


TYLWL*GNPHFYEKNDGGLFEULRAKDEFbfS 
PYWDDMSDSAKHFIRPLTGRDP»KPFPCDQPL 

NLAFLATGWRHMRKLFMGANLEGPGPTVS 
H 


119 


1469 


A 


1486 


1 


398 


GTTSKHH*LARSURGPFDHDLKPNAATRDQL 

NHVSYPPTKQLTYEEQDLGWKFRYYLTNQE 

KALTKFLKWVNWDLPQEAKQALELLGKWK 

PMDVKDSLELLSSHYTNPTVRRYAVARLRQA 

DDEDLLMYL 


120 


1470 


A 


1497 


3 


999 


MGESPAV*GYFVLAGMNSAGLSFGGGAGKY 

LAEWMVHGYPSENVWELDLKRFGALQSSKT 

FLRHRVMEVMPLMYDLKVPHWDFQTGRQL 

RTSPLYDRLDAQGARWMEKHGFERPKYFVP 

PDKDLLALEQSKTFVKPDWFDIVESEVKCCK 

EAVCVnJMSSFTEFEirSTGDQALEVLQYLFS 

NDLDWVGHIVHTGMLNEGGGYEhaXSlAKL 

NKRSFFMISPTOQQYHCWAWIjaCHMPKDSN 

LLLEDVTWKYTALNUGPRAVDVLSELSYAP 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
pq)tidc 
seq- 
uence 


Met 
hod 


SEQ 

BDNa 

in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotnle 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
add residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine!C=Cysteinc, 
D=Aspartic Acid, EFOhitamic Acid, 

l^=^fienv1n1iinmR 4^=Olvr*Jnf> T^=lTictiHinf> 
A a uwj^jniniiiiir^ v^xjlJrujJCi Jl^njMJUinc^ 

I=IsoIeudne, K=Lysinei L!=ieucine, 
^^^4etliiminc, N— Asparagine^ I^Proline^ 
Q=Ghitamine, R=Arghiine, S=S«ine, 
■P=Threonine, V=Valine, W=Tiyptophan, 
Y=Tyrosinci X=<Jnknown, •=Stop codon, 
/=^possible nucleotide deletion, V^possible 
nucleotide insmion 














MTPDHTOSU'CKEMSVGYANGIRVMSNflHr 
GEPGFMLYIPIEYRWGFTMLSTLVSNS 


121 


1471 


A 


1498 


3 


306 


AQELLVGWDHIL*LIVL«TNLTELGRTTCDQN 
WPNSPDVLNHGCFYMQCLSKDCnGYVSRE 
MLVAHTHrVEEHTGTHLQYVSWPDHSVPDD 
SSDFVEFEN 


122 


1472 


A 


1533 


121 


329 


LGLFSFVWlliVLEEPKDFSCETEDFK'nJICT 

WDPOTDTALGWSKQPSQSYrLFES*VGSGYn 

DNFFLA 


123 


1473 


A 


1547 


111 


408 


DARrrWKPRNGSSGIWPGDGAK*PPAVEQAE 
RGHVEMIEKLTFLNLHTSEKDKGGNTAUnA 
AKHGHSPAVQVLLAQWQDINEMNEKQQTPL 
HVAADRG 


124 


1474 


A 


1555 


1 


745 


MIPDDDDKNTYGVALVWKKFQTQSLRLSDL 
HRKSHLWRG^VSm.IEGRDLKA^4DSNGLSDP 
YVKFRLGHQKYKSKIMPKTLNPQWREQFDF 
HLYEERGGVIDrrAWDKDAGKRDDnGRCQV 

dlsalsreqthklelqleegeghlvllvtlt 
asatvsisdlsvnsledqkereeilkryspliu 
fhnlkdvgflqvkvmaegulaadvtgksd 
pfcvvanndrllthtvyknlnpewnkvftl 

•VALVWKKFQTQSIJU-SDLHRKSHLWRGIVS 
rrUEGRDLKAMDSNGLSDPYVKFRLGHQKY 
KSKIMPKTLNPQWREQFDFHLYEERGGVrorr 
AWDKDAGKRDDnORCQVDLSALSREQTHK 
LELQLEEOEGHLVLLVTLTASATVSISDLSVN 

KVIRAEGIJVIAADVTGKSDPFCVVELNNDRLL 


125 


1475 


A 


1556 


57 


509 


GGPAPNSRYAEP*KNSLAMT»AHADCENYVA 

CGOLDNICSIYNLKTREONVRVSRELPOHTQY 

LSCCRFLDDSQIVTSSGDTTCALWDIETAQQT 

TTFTGHSGDVMSLSLSPDMRTFVSGACDASS 

KLWDIRDGMCRQSFTGHVSDINAVS 


126 


1476 


A 


1592 


3 


178 


EMUTa>LADQHNIKFHYAFALNR*ER 


127 


1477 


A 


1612 


1 


497 


TESIU-VRPYlJ'YJIXSiaJHL'UMTAGFSTIAGS 

FWPETEKPKrrLKNAMKMESGDSGNLL*AAT 
QGASSSISLVANIAVNLIAFLALLSFMNSALA 
WVGNMFDYPQLSFELICSYIFMPFSFMMGVE 
WPDSFM 


128 


1478 


A 


1619 


286 


486 


EDEVDFRASSISEEVAVGSIAATLKMKQGPM 
TQAINR 


129 


1479 


A 


1627 


1 


395 


PTRGAURYWIFGRFLCNIWAAVDVRCCTATI 
MGLCDSIDRYVOVSYPLRYPTIVTQRRGLMA 
LLCVWALSLVIYIGPI 7 ilWRHPAPFnPTrpni 

NEEPGYVLFSTPGSFYLPLAIMLVMN'RVYRV 
ARTE 


130 


1480 


A 


1638 


2 


466 


DPRVRTKIYNRKTnYEIQDKTGSMAWGKG 

ECHNIPCEKGDKLRLFCFRLRKRENMSKLMS 

EMHSFIQIQKNTNQRSHDSRSMALPQEQSQHP 

ia>SEASTTLPESHLKTPQMPPriPSSSSFTKVT 

KDKDIK*LLFNLYSSVEILPEVLHLKT 


131 


1481 


A 


1651 


607 


3 


LAEGGDVFDCVLNGGPLPESRAKALFRQMVE 
AIRYCHGCQVAHRDLKCENAIXQOFNLKLTD 
FGFAKVLPKSHRELSQTFCGSTAYAAPEYLQ 
GIPHDSKKGDVWSMGWLYVMLCASLPFDD 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
pqjtide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

begixming 

nucleotide 

location 

correspondi 

ngtofiist 

amino add 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A^Alanine O^Cysteine, 
D^Aspaitic Add, EKilutamic Add, 
I^PhenvlalaninCL G=01vcine. I^HisHrlinp 
Msoleudne, K=<.ysine, L=Leucine, 
\t=Metfaianin^ N=Asparagine, P=Prolinei 
Q=Glutainine, R^Ai^inine, S=Serine, 
T=01ireonine, V=Valine, W=Tiyptophan, 
Y=Tyrosine, X^Unknown, ♦=Stop codon, 
/=possible nucleotide deletion, V=possible 
nudeotide insertion 














TDffKMLWQQQKGVSFnHLSISADCXJDLLli 

RLLEPDMILRPSIEEVSAraPWLAST**KQWQV 

LSNKVGGESKPKKKK 


132 


14S2 


A 


1636 


150 


48 


EANLTSPSPKPlPSSDM*VFLr!r»TYFGAWHV 
VDAQ 


133 


1483 


A 


1660 


3 


406 


RKHIKLLIQKLSDVP*ECQNNQL»KLTEICEKE 
KKEFKKKMDDQRPEK]TEA*SKDKSPMEEEK 
TEMIRSYIQEVGRYIKRLEEAQSKRLEKLREK 
HKEmQPnXlEKPKGEGSSSFLSETCHEDfrSWF 
PNFIP 


134 


1484 


A 


1666 


127^ 


466 


PGSTHASAKl'nY*L*lILSNATEVDNNFSKPPP 

PPPPGAPPPSLIFTIESGHSSGYDSRSARAFPYG 
NVAFPHLPGSAPSWPSLVDTSKQWDYYARSS 
SSSSSSSSSSSSSPRDRDRER*RTREREREKDHS 
PIPSVFNSDEERYRYREYAERGYERHRASRE 

ITPI? 1> TJD TTD 1> lTDTTV'CIT'n? O CD CTKTC'D T>T> TTCcm 
Ffcj^K K ri K. K K i-i K f*.K r. K \ Krll\ooKoI>l oKlCKllboJb 

EGDSHRRHKHKKSKRSKEGKEAGSEPAPEQE 
SIEATPAE 


135 


14SS 


A 


1673 


1 


417 


PTRPVNSSQAEALVYYTLGALGGNLIAHMOL 
GYRYWAGIGVLQSCESALTHYRLVANHVAS 
DISLTGGSWQRIRLPDEVENPGMNSGMLQE 

ULiiy^J I Ijr L«/\CiV.ViU V IjA^j V vil-.Litjl_.H 1 .HI tt yR 

OV»QNHQRAFDYFNLAA 


136 


1486 


A 


1678 


525 


9 


ANTSLSSAAVSAVSPPPCXTSTATTLPPPMPSF 

FCVFPSPSMSPSPSBFLSCIASVSRVHSLSSSSS 

GSSSTASSLNFSAIMGSSSATASWVLSTASTPP 

CPSALPSSPAQES*SLAASSSAWPVAGISPSGA 

CTFPAGSASGAAKAPSPSWRCPSFRALFSLLD 

SSSLSL 


137 


1487 


A 


1680 


1 


2999 


AHRDEIQRKFDALRNSCTVirDLEEQLNQLTE 

DNAELNNQNFYLSKQLDEASGANDEIVQLRS 

EVDHUUlErrEREMQLTSQKQTMEALKTTCT 

MLEEQVMDLEALNDELLEKERQWEAWRSVL 

GDHC9QFEC31VRELQRMLDTEKQSRARADQ 

RTTESRQWELAVKEHKAEILALQQALKEQK 

IXAESLSDKXM)LEKKHAN4LEMNARSLQQK 

LETERELKQMXEEQAKLQQQMDLQKNHIFR 

LTQGLQEALDRADLLKTERSDLEYQLENIQV 

LYSHEKVKMEGTISQQTKLIDFLQAKMDQPA 

KKKKVPLQYNELKLALEKEKARCAELEEALQ 

KTRIELRSAREEAAHRKATDHPHPSTPATARQ 

QIAMSAIVRSPEHQPSAMSLLAPPSSRRKESST 

PEEFSIUU-KERMHHNIPHRFNVGLNMRATKC 

AVCLDTVHFGRQASKCLECQVMCHWCCSTC 

LPATCGLPAEYVTHFTEAFCRDKMNSPGLQr 

KEPSSSLHLEGWMKVPKNNKRGQQGWDRK 

YTVLEGSKVLTYDNEAREAGQRPVEEFELCLP 

DGDVSIHGAVGASELANTAKADVPYILKMES 

HPHTTCWPGRT1,YLIJ\PSFPDKQRWVTALES 

WAGGRVSREKAEADAKLLGNSLLKLEGDD 

RLDMNCTLPFSDQWLVGTEEGLYALNVLK 

NSLTHVPGIGAVFQIYIIKDLEKLLMIAGEERA 

LCLVDVKKVKQSLAQSHLPAQPDISFNIFEAV 

KGCHLFGAGKIENGLaCAAMPSKWlLRYN 

ENLSKYORKEIETSEPCSCIHFTNYSILIGTNK 

FYEIDMKQYTLEEFLDKNDHSLAPAVFAASS 

NSFPVSXVQVNSAGQREEYLLOTIEFGVFVDS 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence ' 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino add 

residue of 

pq>tide 

sequence 


Predicted end 
nucleotide 
location 
conesponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A^Alanine OCysteine, 
I>=Aspaitic Add, EpOlutamicAdd. 
F=^TiHiylalanine, 0=Glydne, H^Histidine, 
I=Isoteucine, KHLysine, LF=Leucine, 
M=Metliianine, N=Asparagine, P=Proline, 
Q=Glutaniine, S.=Arghine, S=Serine, 
T=nneonine, V=VaIine, W=Tiyptoplian, 
Y=7yTosux, X'^Unknown, *=Stop oodon, 
/^possible nucleotide deletion, \=^ossible 
nucteotide insertion 














YGRRSRTDDLKWSRLPLAFAYREPYLFVTHF 
NSLEVffilQARSSAGTPARAYLDnWRYLGPA 
ISSGAIYLASSYQDKLRVICCKGNLVKESGTE 
HHRGPSTSRR*PASPLPQYQQQRAFLQGRRK 


138 


1488 


A 


1686 


2 


526 


GRPQOPAPGAGSPPESGPGLWAALGCSLVWV 

PLCCLGGAAGRL*ARSGKSGLRIIRRAHAGPP 

PGGPCNSCP*CSAPESGGRGPLPGPGTGGVCS 

CWTRQCXTITARTAAAAAAPGPAORRPPOQA 

PQNGSCAASASQEAAAPPmCPPGRRWAVAS 

PPETRCPAAPGTRCRRLEAA 


139 


1489 


A 


1693 


3 


376 


LPSMSNCTSa-RLQSRTES«IRQAGHLLGRNE 
FIETKALGCAWFSLCYYLVLYFESSHKVDFVF 
IV*CFSTPPGAQMTIMSQACAERCNIMRLVDR 
RWAGIAKOVGTQKnORVHLOEQKALGL 


140 


1490 


A 


1704 


3 


376 


ERTNKITKEIJMIX3KNL1AATKSLSVAQRKFA 
HSLRDFKFEFIGDAVTDDERCIDASLREFSNFL 
KNlEEQREIMVS'EGCKUSQLSRGKKIWnVK 
LVLVEWKHLSLGTWHCNGKMRFPEP 


141 


1491 


A 


1743 


1 


362 


LITNKVFVARELSCLDVHUJSTGSTAVVADQ 
DKLELELVLKGSYEDTQTSFLGTASAFRFHY 
MAAL*TELSGRLRSSKSNGWNGDNSTGYLTV 
PLRPLTTVKEVTMDVPAPNVRGLNWMG 


142 


1492 


A 


1769 


1 


406 


NIJPSTLPRGS'PMSPRITMGRRRQRRREHKSS 

LSLASSTVGPGGQIVHTEnEWLCODPLSOF 

GLQLQGGIFATETLSSPPLVCFIEPDSPAERCG 

LLQVGDRVLSINGIATEDGTMEEANQLLRDA 

ALAHKW 


143 


1493 


A 


1789 


1 


447 


QMLRNGGDQNTVPDYHFADRIRELL»PTEDQ 
KNCIP*DTYLRPSALGNIVEEVTHPCSPGPCPA 
NELCEVNRKGCTSGDPCLPYFCVQGCKLGQA 
SDFIARQGTUQVPSSAGEVECYKICSCGQSGL 
LENCMEMHCMDLPTDTSALVR 


144 


1494 


A 


1814 


1 


404 


PGRRERPRLSQAGTDSGS*VFPDSFPSAPAEPL 
PYFLQEPQDAYIVKNKPVELRCRAFPATQIYF 
KCNGEWVSQNDHVTQEOLDEATQLRVREVH 
EEVSRQQVEELFQLEDYWCQCVAWSSAGTTK 
SRRAYVRI 


145 


1495 


A 


1827 


26 


448 


XVEEiaiADTWRSXCLSDFFFHAAKXLCXE»N 
CGDAISLSVGDHFGKGNGLTWAEKFQCEGSE 
THLALCPIVQHPEDTCIHSREVGWCSRYTDV 
RLVNGKSQCX>QQVEINVLGHWOSLCDTHWD 
PmJARVLCRQLNCGTAL 


146 


1496 


A 


1828 


574 


333 


QHEGGDLRRRQLGEIQLTVRYVCLRAASAC* 
SMAAET*HHVPASGADPYVRVYLU»ERKWA 
CRKKTSVKRKTLEPLEDET 


147 


1497 


A 


1855 


1 


372 


ERLVLTSEHCLVLTLFWPSWTYHTLLLSRQH 

VRIUJ>ia-m\EHDHLASIMNKLLTNYDNLFE 

TSVTYSMG»HGAPTGSEAGANWNH**LHAH 

YYPPLLRSDTVRKFMVGSQKOAQAQRDLTPE 

Q 


148 


1498 


A 


1879 


568 


7 


LLSALDDKGOTQPSASFSNAPTIVCVTACPAG 

lAHryMAAEYLEKAGRKLGVNVYVEKQQAN 

GIEGRLTADQLNSATACIFAAEVAIKESERFN 

GIPALSVPVAEPIRHAEAUtlQQALTLKRSDET 

RTVQQDTQPVKSVKTELKQALLSGISFAVPU 

VAGGTQVA*AV*RQGISSLHDVQVRTWNS 


149 


1499 


A 


1880 


611 


24 


GLNSENALSNBAMERGWQCLRLFAERLQDIP 
PSQIRWATATLRLAVNAGDFIAKAQEILGCP 
VQVISGEEEARUYQGVAHTTGGADQRLWD 
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PCT/USOl/03800 



SEQ ID 
NO: of 
nud* 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
bod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

locatiaD 

conespondi 

ngtofiist 

Bmino 8cid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
add residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
D^Aspaitic Acid, E^GIutamic Acid, 
r=nimyiajlanine^ u^^jlycuie, rl=Yiisuauie^ 
I=Isoleucine, K=Lysine, L=Leucin^ 
M=Metliionine, N=Asparagine, P=Proline; 
Q=Glutamine, R=Aiginine, S=Serine, 

TltaaAAs* ana T f ^/ii IT i ii VIT 1* t 1 

1— inreonme, v— vaime, w=iiyptopiian, 
Y=Tyrosine, X>=Unknown, *=Stop codon, 
/^possible nucleotide deletion, \=pc6sible 
nucleotide insertioa 














IGGASTELVrGTGAQTT*LFSLSMGCVTWLER 
YFADRNLGQENFDAAQKAAREVLRPVADEL 
RYHSWKEVRGASVTVQALQEIMMAQGMDE 
RTTMETWPVD 


13U 


IjUU 


A 


1894 


2 


750 


GRVDFFHTOYRPLIRDSNNYVLDEQTQQAPH 
LMFPPFLVDVDGNPHPTKYQRLVPGRENSAD 
EHLIPQLGYVATSDGEVIEQnSLQTNDNDERS 
PESSILDGMIRQLQQQQDQRMGADQDTIPRG 
LSNGEETPKRGFRRLSLDIQSPPNIGLRRSQQV 
EGVRQMHQNAPRSQIATERDLQAWKRRVW 
PEVPLGIFRKLEDFRLEKGEEERNLYnGRKRK 
TLQLSHKSDSVGLVSQSRPRTCRRKYP 


151 


1501 


A 


1900 


141 


785 


GKTIQIQTTMQNKYKTVQKQYKnPKNKKA 

MEMQIKKQFQDTCKVQTKQYKALKNHQLEV 

TPKNEHKTOJCTLKDEQTRIXAILAEQYEQSI 

NEMMASQAURLDEAQEAECQALRLQLQQEM 

ELLNAYQSKIKMQTEAQHERELQKLEQRVSL 

RRAHLEQKIEEELAALQKERSERIKNLLBRQE 

REIETFDMESLRMGFGNLVTUDFPKEDYR 


1S2 


1502 


A 


1915 


2 


377 


LVRLLDTQRDGLQNYEALLGLTNLSGRSDKL 

RQKIFKERALPDIENYNIFENHDQLRQAATEC 

MCNMVLHKEVQERFLADGNDRLiaWIlCG 

EDDDKVQNAAAGAU^MLTAAHKKUXKMT 

QVTT 


153 


1503 


A 


1921 


1 


237 


AYQSLRLEYLQIPPVSRAYrTACVLTSAAVQL 
EUTPFQLYFffEUFKHFQIWRLITNFLFFVPFG 
FNFLLYMIFLYT 


154 


1S04 


A 


1928 


2 


354 


EMVEGGEGKMCINlKWGGFGDNGCIDDIRTR 
YDTEVDEGSLNPGKQRYEKMTSGMYLGEIV 
RQILIDLTKQGLLFRGQISERLRTRGIFETKFLS 
QlEbDRI^ALLQVRRILQQLGLD 


155 


1505 


A 


1929 


2 


369 


TEIAKKMEAKKKYBKELTMFQNDFBKACQA 
KSEALVLREKSTLERIHKHQEIEnaaYAQRQ 
LIXKDMDLLRuREAELKQRVEAFESYQLELK 
DDYURTYRLBEDDRINIQISGHWQBSP 


156 


1506 


A 


1935 


1 


270 


VTRKLPIFIVDAFTARAFROSPAADCLLENEL 
DEDMHQKIAREMNLSETAFIRKLHFroNFAQ 
RSCFGUWFTPnDLQILTSSILPSIL 


kt 
Ij/ 


IDl// 


A 
A 






305 


l»KVNNIiKFRTKSrKrAEaI^SATK.QLDQPTA 
AYEYYDAGNHWCKDCJJTICGTMFDFFTHMH 
NKKHTQGQFQKSSDFQKEELQQTFLPPERQG 


158 


1508 


A 


1939 


1 


423 


TTHRLNVTAEPPCTSMPIYWAIPDVPHRCTTA 
NTCPVDLTDYCAQNGFYCLVYCTLFYGSLED 
KLHCQl ACrrLo Wi^KLJJlJLL.u 1 AKAIQFLH 

QDSPSLIHGDIKSSNVLLEffiRLTPKLGDFGLA 
RFSRFAGSSPIQSSM 


159 


1509 


A 


1974 


3 


401 


HTSTARLLLHRGAGKEAVrSDOYTALHLAAR 

NGHLATVKLLVEEKADVLARGPLNQTALHL 

AAAHuHoliVVnlu^VaAiJVUJU'UcQGl^ALH 

LAAQGRHAQTVETLLRHGAHINLQSLKFQGG 

HGPAATLLR 


160 


1510 


A 


1982 


2 


417 


KFI Km FKOYNTf PFPHI SPIfiSrPI iTNfnPfiPA 
lYSBYCNNHPGACXELANLMKQGKYRHFFEA 
CRU.QQMIDIAIDGFLLTPVQKICXYPLQLAEL 
LKYTTQEHODYSNIKAAYEAMKNVACLINER 
KRKLESIDKIA 


161 


1511 


A 


1984 


4 


770 


RETGSVSLSPSGLEGAESYAVSPILYSSPDVKE 
LWLETLQGQRHSHTGVKSTPGQSAAILMKLR 
SSHNASBCTLNANNMBTUECQSEGDIKHIPLL 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ED 
NO: of 
fxptide 
seq- 
uence 


hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

begijuiiiig 

nucleotide 

Jocatian 

coirespondi 

ngtofiist 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
coirespoading 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D^Aspartic Acid, EF<jlutamic Acid, 
F^^coyhlanine, G=Glycinev IWfistidine, 
Msoleudn^ K=Lysine, L=Leucine; 
M=^iIethlonine, N=Asparagine^ P=Proline, 
QKHutamine, Ib=AiBininei, S=Serine, 
T=Threoninei, V=Valine, W=Tiyptophan, 
Y=Tyrosine, X°°UnlaK>wn, *=Stop codon, 
/^possible nucleotide ddction, V^ssiblc 
nucleotide insertion 














ASCESEDSICQLIEVKKRKKVLSWPFLMRRLS 
PASDFSGALETDLKASLFDQPLSnCGDSDTLP 
RPIQDXLTILCLKGPSTEGIFRRAANEKARKEL 
KEELNSGDAVDLERLPVHLLAWFKDFLRSIP 
RKLLSSDLFEEWMGALEMQDEEDRIEAUC 


162 


1S12 


A 


1986 


864 


501 


liNSGLFSAPDGSNLEMRLTRGGNMCSGRIEI 
KFQGRWGTVCDDNFNIDHASVICRQLECGSA 
VSFSGSSNFGEGSGPIWFDDLICNGNESALWN 
CKHQGWOKHNCDHAEDAGVICSSKD 


163 


1S13 


A 


2001 


419 


187 


AVDLSIDESSLTGETTPCSKVTAPQPAATNGD 
LASRSNIAFMGTLVRCGKAKGWIGTGENSE 
FGDIINLSTFWHS 


164 


1S14 


A 


2012 


284 


597 


SLLOUFPGTSTWCXPIVIETQLYVIVAQLFQO 
SHTVnKRDSFANKFIKIQAIElLKIRKPNDlETFia 
ENNWYrWADSSKAGFmYKWERETGFYSH 
QSFm 


165 


1515 


A 


2013 


2 


403 


EDPEELGHFYDYPMALFSTFELPLTIIDGPANY 
NVDLPFMYSITYAAFAIIATLLNfljnXIAMMQ 
DTHWRVAHERDELWRAQIVATTVMLERKLP 
RCXWPRSGICGREYGLGDRWILRVmjRQDLN 
RQRIQRYA 


166 


1516 


A 


2019 


2 


927 


COQREGLGLKAWQILLSHGRNGLPGEPASS 

QGLSAASSTPVFHLALQIDSAPDNIDWVEMLF 

NKNMVTERLQNVMVLEQCFSDSSSLYRFLTY 

SYlXAlWWLLLAPVTLCYDWQVGSIPLVEn 

WDMRNLATIFLAWMALLSLHCLAAFKRLE 

HKEVLVGLLFLVFPFIPASNLFFRVGFWAER 

VLYMPSMGYCELFVHGLSKLCTWLNRCGATT 

LrVSTVLLLLlJSWKTVKQNEIWLSRESLFRS 

GVQTLPHNAKVHYNYANFLKDQGRNKEAIY 

HYRTALNNNKAWDYLCWRFRKTLTDLP 


167 


1517 


A 


2025 


696 


71 


AAASAASSLTVTLGRLASACSHSILRPSGPGA 

ASLWSASRRFNSQSTSYLPGYVPKTSLSSPPW 

PEWLPDPVEETRHHAEWKKVNEMIVTGQY 

GRLFAWHFASRQWKVTSEDLILIGNELDLA 

CGERIRUEKVLLVOADNFTLLGKPLLGKDLV 

RVEATVIHCTESWPRnMIUTUaUCNFKKKRIV 

TTPQTVLRINSIEIAPCLL 


168 


1518 


A 


2046 


2 


366 


HLQVAARVFMPLQAVDSAPKPLKGQAQAPQ 
RLQGAARVFMPLQAQVKAKASKPLQMQKA 
PPRLRRAARVLMPLQAQVRAPRLLQVQSQVS 
KKQQAQTQTSEPQDLDQVPEEFQGQDQVLR 


169 


1519 


A 


2049 


1 


945 


QNLEDRJEVLNGVQTELLTSPRTKDTLSDMTR 

TVmSGEGGPLGIHWPFFSSLSGRILGLFIRGI 

EDNSRSKREGLFHENECIVKINNVDLVDKrFA 

QAQDVFRQAMKSPSVLLHVLPPQNREQYEKS 

VIOSLNIFaNNDGVLKTKVPPPVHGKSGLKTA 

NLTGTDSPETDASASLQQNKSPRVPRLGGKPS 

SPSLSPLMCTGSNKNAKKIKIDLKKGPEGLGF 

TWTRDSSIHGPGPIFVKNILPKGAAIKDGRLQ 

SGDRILEVNGRDVTGRTQEELVAMLRSTKQG 

ETASLVIARQEGHELPRELVMFRSQSH 


170 


1520 


A 


2050 


363 


1 


PVATHLTKILNSDEHAWISSAKTLCETVKDF 
VAKVEKTYDKTLENAWADAVASKCSVLNE 
KLEQIXQALHTDSQAAPVLPGLSPUVEEDAV 
ESSSEESLGESKEQLGDDVTKPSSQKA 


171 


1521 


A 


2055 


139 


675 


IPSRPWLGRrrOLDPAGPLFNGKPHQDRLDPS 
DAQFVDVIHSDTDALOYKEPLGNIDFyPNOG 
LDQPGCPKTILGGFQYFKCDHQRSVYLYLSSL 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
DCDtide 
seq- 
uence 


hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

pqjtide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A==Alaiiine C==Cy5tBine, 
I>Aspartic Acid, E°Kjlutamic Acid, 
F=Phcnylalaninc, O^dycin^ H==4Iistidine, 
I=Isoleiicine, K=Lysine, IM^dne, 
M=^tethionine, N=Asparagine, P^Proline, 
Q=01utaniine, R=ArgininG^ S=S€rine^ 
I^Tlireonine, V=Valine, W=Tiyptophan, 
Y^Tyrosine, X'^Unknown, *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 














RESCTITAYPCDSYQDYKNGKCVSCGTSQKB 
SOTiGYYADNWKDHLRGKDPPMTKAETDT 
AEESPFCMYHYFVDirrWNKNVR 


172 


1522 


A 


2056 


3 


36r 


LIQHKSAVEYAQSHLSLVSMCKESHKCSEPK 
MFWicvirnisrKiTRYTriiniPVTjrjiiTT vtth at 

.KEERSGLTTDDDTMSEMKMGRYWSKEERKQ 
HLVRGKEQRRRREFMMRIRLKCLKES 


173 


1523 


A 


2060 


1 


387 


GTMLSMQIPFVGFQPIRTSEHMAAAGVFALL 
OAYAFI OYT RDRT TKOFFOTI FFI/iVSl A Art 

AVFLSVIYLTYTGYIAPWSGRFYSLWDTGYA 
KIHIPIIASVSEHQPTTWVSFFFDLHILGCTFPA 
G 


174 


1524 


\ 


2071 


It 




KRUCE^EAKLKYEKEEMERLEIQIUEKEICW 

HRLEAKDLERBNEELEELYLLERCFPEAEKLK 

QETKLLSQWKHYIQCDGSPDPSVAQEMNT 


175 


1525 


A 


2083 


139 


486 


AALTWSQPQEFWPMEMQPIVTDMVTVHWV 
AESSTVGWLCALFRVTHVGVGATGHGWCG 
RRVLCGLPLPSPAPMPIMSLPEGESRKEREVQ 


176 


1526 


A 


2092 


3 


587 


EGSVNFKFGVLFAKDGQLTDDEMFSNEIGSEP 
FQKFLNLIXjimTLKGWTGYRGGLDnrKNDTT 
GIHSVYTVYQGHEIMFHVSTMLPYSKENKQQ 
VERKRHIGNDIVTIVFQEGEESSPAFKPSMmS 

T4T7T*UTT7AT \/l> VWOrtXTrVKIVDl tm7d7T}CVT>T C/^ 

xiT 1 nlTALi Viv I INl^l^ni^n 1 KljKlr dtlco Vr JLrVj 
PPUTTPVFTDHQEFRDFLLVKLINGEKATLET 

pa 


177 


1527 


A 


2103 


44 


427 


GKGQVSLEGRPHRGPLCLGSWWPGSRVPGC 
CDGAWLAWACWVFGNDFPSPASAACSALLG 
r*^v^TArT r'VPi p^joopt apfpdtaat nnm i> 

RAVSVPLTLAETVASLWPALQELARCGNLAC 
RSDLQ 


178 


1528 


A 


2104 


2 


409 


ALQSTXGAVWLGLLLNSLWKVAESKDQVFQ 

PRTA A^yCTSlfiAWPTPfTsniQV^MAVMPlTU/Vl 1JI7 

PGCAPRLLVKGSJa>SQQGRYNMTYERFSSSL 
LILQVREADAAVYYCAVEVPNTDKLIFGTGT 
RLQVFPNIQNPD 


179 


1529 


A 


2111 


1 


312 


MRTESHTGIJKKGGNANLVFMUCRNTEPKKG 
SYHFDLERLRAAHILFEREQEHLAPGGISMPL 
PPPLPLPACLO 


180 


1530 


A 


2)16 


3 


366 


TSKRAIETTDVTRSFGWDSSEAWQQHDVQE 
LCRVMFDALEOICWKCJTROAni INFI YOfSlf T 
KDYVRSLECGYEGWRIDTYLDIPLVIRPyGSS 
QAFASWCTFHLTACVSLHRIHNSTVV 


181 


1531 


A 


2117 


2 


386. 


YGLGAHFGRLFIQAGINENDFTOGAWCAGR 
NDLQQWXEVDAiOlLTRFrGVITCJGRNSLWLS 
DWVTSYKVMVSNDSmWVTGKNGSGDMIFE 
GNSEKEIPVLNELPVPMVARYIHTNPOSWFnN 

GSia 


182 


1532 


A 


2123 


1 


493 


RTKTOVYILNLAVADLLUJTXPFWAVNAVH 

GWVLGKIMCKITSALYTLNFVSGMQFLACISI 

DRYVAVTKVPSQSGVGKPCWnCFCVWMAAl 

LLSn>QLVFYTVNDNARaPIFHiyLGTSMKAL 

IQMLHaCFVVITLIMGVCOTrrARTLMKMP 

NKIS 


183 


1533 


A 


2140 


3 


561 


RQAWHEAFKVRKEILTVICCLLAFCIGLIFVQ 
RSGNYFVTMFDDYSATLFLLrVVILE>aAVCF 
VYGIDKFMEDLKDMLGFAPSRYYYYMWKYI 
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SEQ ID 
NO: of 
oucl** 
eodde 
seq- 

UCDCC 


SEQ ID 
NO: of 

seq- 
uence 


Met 
hod 


SEQ 
ID NO: 

USSN 
09/496 
914 


Predicted 
beginning 

location 
conespozidi 

Tiff tn fii*ct 
111 i^oi 

amino acid 
residue of 
peptide 
sequence 


Prsdictedmd 
nucleotide 

corre^nding 
to last amiDO 

ouu resuuuc 

of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
D=Aspartic Add, EKjlutanuc Acid, 
-T < ncnyiduanui^ Vj^u^cinc^ rP^tlisuOiD^ 
I=Isoleuciiie, K=Lysme, L==Leucme, 
M=Methion2ne; N=A^>aragine; P^Proline, 

I^=^vhitfkmni0 _ A ■ 1 1 Ml ■■ 1 II 1^ "nil II 

vf=vjiuuiiiiii]e, t^ATguunc^ cr^oenne^ 
T=Threonine, V=Valinei, ■W"=<rryptoiAan, 
Y=Tyrosinc, X^Unknown, *=Stop codon, 
/=p(Ksible nucleotide ddction, V^ossible 
nucleotide insertion 














SPLMLLSLUASWNMGLSPPGYNAWIEDKAS 
£iCrLf01Jrl WOljAVLiAolJJVrAlLir VrVArlviK 
RFSLIDDGAGPFCSAAYTTTGCRTPYL 


184 


1534 


A 


2145 


3 


538 


HELTVAAADRGQPPQSSWPVTVTVLDVND 

NPPVFTRASYRVTVPEDTPVGAELLHVEASD 

ADPGPHGLVRnvSSGDPSCaUFELDESSGTLR 

LAHALDCETQARHQLWQAADPAGAHFALA 

PVTIEVQDVNDHGPAFPLNLLSTSVAENQPPG 

TLVTTLHAIDGDAGAFGRLRYHL 


1 9C 




A 

A 


213) 


2 


671 


LDKLLDRMENYNilTMiiYILKQVAATYIKLGW 

PKNNFNGSLVQASYQHEELRREVIMLACSFG 

NKHCHQQ/^STUSDWISSNRNRIPLNVRDIVY 

CTGVSLLDEDVWEFIWMKFHSTTAVSEKKIL 

LEALTCSDDRNLLNRLLNLSLNSEWLDQDAI 

DVIIHVARNPHGiU)LAwKEFRDKWKILNTRI 

RQKTLEFDFAEPLILAFPnLYTAIDICPLVREH 

E 


ISA. 

loo 




A 

A 


il5i 


2 


400 


OKML.JJKHS/U'AIlKJ'HACiJ'lUlflVHI'LWJSTWA 
HLALPDAQDILYTLEDNRNWVDSMIPQSPSPP 
LDEQNRDWQGLLENLHVELTLDEEDSEGPEK 
EGEOQTYFTSSKTLCGIVPQNTDSLGETGIHIC 
AHDKSP 


187 


1537 


A 


2158 


227 


442 


FNCFRVASDSFLENSSLLIMILPLRNATQEFIIR 

PGAVAYTCNPSTLGGWGGWrrRSGVRDQPG 

QHGGTPS 


188 


1538 


A 


2167 


3 


486 


AHLGGAWLTQRSLGSWAAPGPARAAKEWA 

CBPQNQKMNIWRMKTSKHLQLLSFVLGAVSP 

AVWPYMMVLQENGYGVEEGIPTLLMAASS 

MDDILAITGrNTCXSIVFSSGCARSSGSKNSKS 

LRTPLGTICEGCDDSSIFSHLDHSSKWSSTyG 

HSGA 


189 


1539 


A 


2168 


2 


412 


EFLSSNQTTQLFNTTFRPMPNLRSVDLSYNXL 
(jAX^AriJLFUuUuuUi lUiMKANAIQFVPVRIF 
QDCRSLKFLDIGYNQLKSLAKNSFAGLFKLTE 
UILEHNDLVKVNFAHFPRLISLHSLCLRRNKV 
AIWSSLDW 


190 


1540 


A 


2179 


64 


399 


MRLNQNTLLLESFGXXRPYTSEHAPTYHQW 
MKADELLRWTTSEPLTLEHEYAMQRTWLED 
A I JiU 1 r 1 V JJJ AiuUCrlAv^rixA 1 liiboL.M V CrU VN 
LFLTDLEDLTLGEffiVLIAEP 


101 

171 


1541 


A 


9iQn 


1 


AAQ 


t~*\ T^Tl A A/^TDXTTTDXrV/TVTKlU'l'UTDITD^nrf AUTI 

\^ljJL/JKAAulKilJaKIN VIii^iclxlliuiKuWLiAKK 

LSYVLFIQERDVHKGMFATNVTENVLNSSRV 
QEAIAEVAAELNPDGSAQQQSKAVNKVKKK 
AKRBLQEMVATVSPAMIRLTGWVLLKLFNSF 
FWNIQIHKGQLEMVKAA7ETNLPLLFIJPVHR 
on 


192 


1542 


A 


2197 


26 


157 


PSKXGGIRLLLTGTQLYGRFGSAIAPLGDLDR 
DGYNGEGREEPY 


193 


1543 


A 


2236 


2 


383 


EYFPNSIWRSIJSTNIDLGDIGFYTYRILQALS 

VTIJOV/~'T\XlJOT*17VDf XTTT /^7C1J1>V7^%/T1 ATMlf 

Y IHo&uIMMKUVJU'J^NiLCNol'KNKLVlLAUW 
GLAEFYHPMRKYSVHVATRYYKSPEILLDYE 
YYDYSLDIWAVGVILLELLTLKLHVFEGGDN 
EQ 


194 


1544 


A 


2241 


105 


409 


RKGVGKMPTSEGRPGQERSDWVTSYKVMGS 
NDSHTWVTVKNGS(H)MIFEGNSEKEIPVLNE 
LPVPMGARYIRINPQSWFDNOSICMRMEILGC 
PLPDFNNY 


195 


1545 


A 


2245 


1 


672 


MGV/\SDWTKRIEYQPGSGSMPLFPSIHLBTCD 
GAVSSUJIVTELQTNYIGKGCDRETYSEKSLQ 
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SEQ ID 

NO: of 

oud' 

eodde 

seq- 

neoce 


SEQ ID 
NO: of 

seq- 
nence 


Met 
hod 


SEQ 
ID NO: 

ID 

USSN 
09/496 
914 


Predicted 

beginning 

Ducleotide 

locatioa 

correspondi 

ngtofiist 

amino acid 

residue of 

pepuue 

sequence 


Predicted end 
nucleotide 

lOCStlOD 

corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino add sequence (A=Aianine OCysteine, 
I>=Aspartic Add, E^Glutamic Acid, 
J*— rDoiyiaiaDuie, u=<jiycine, iF=£iisndme, 
I=Isoleacine, K=Lysine, LHLeucinei 
M=Kdetbiaaine, N=Asparagine, P=Pn)line, 
Q=Glutamine, R=ATginine, S=^erine, 
T-Threonine, V=Valine, W-Tiyptojiian, 
Y^TjTOsinci X^Unknown, *=Stop codon, 
/^possible nucleotide deletion, V=pos5ible 
nucleotide insenion 














KLCGASSGUDLLPSPSAATOWTAGLLVDSSB 

MIFKFDGRQGAKlPDGIVPKNLTDQFnTMW 

MKHGPSPGVRAEKETILCYSDKTEMNRHHY 

ALi I VHivCKLVrijlJvlUJrJLMiAIjn'RPAEFHW 

KLDQQALAKVDGQPGKSITRQLQHVIPVTIQG 

ISLKPS 


1% 


1546 


A 


2256 


1 


396 


KRGTPVSGLTNWmAVnUIFREPIRLKTVKP 
GKVINKDLRHYLSLQFQKGSIDHKLQQVIRD 
NLYUITIPCTTRAPRDGEVPGVDYNFISVEQF 
KALEESQALLESGTYDGNFY GTPKPPAEPSPF 
QPDPV 


197 


1547 


A 


2259 


43 


594 


QLAIEIGVRALLFGVFVh'iKFLDPFQRVIQPEEl 

WLYKNPLGQSDNIPTRLMFAISFLTPLAVICV 

VKimRTDKTElKEAFLAVSlAI.AU4GVCTNTI 

KLFVGRPRPDFFYRCFPDGVMNSHvlHCTGDP 

DLVSEGRKSFPSmSSFAFSGLGFTTFYLAGKL 

HCFOIESGRGKSWRLCAAILPL 


198 


1548 


A 


2275 


3 


404 


rCTlVVVIPRMLVDFLSESKTISLPECATQMFF 

FLGFASNNCFIMAAMSYDRYTAIHNPLQYHT 

UvmuaCLQKOvIMASmiVOFLFSLClIVTVFN 

LSLCDLNTIQHYFCDISPVVSLACNYTFYHEM 

AlFVLSA 


199 


1549 


A 


2315 


1 


375 


LTXJMFFIHALSAIESTILLAMAFDRYVAICHPL 
RHAAVLNNTVTAQIGIVAWRGSLFFFPLPLLI 
KRLAFCHSNVLSHSYCVHQDVMKLAYADTL 
PNWYGLTAILLVMGXDRMFISLSYFLn 


200 


1550 


A 


2334 


2 


409 


PRVRPQQRKMSFFFKTELGEKLVTKFLFETDF 
SDDPMLPSPDQLKXKAPFTNKKLKAHQTPVD 
ILKQKAHQLASMQVQAYNGGNANPRPANNE 
EEEDEEDEYDYDYESLSDDNILEDRPENKSCH 
DQLQFEYKEEM 


201 


1551 


A 


2350 


3 


512 


ISWEAQIAEnQWVSDEKDARGYLQALASKM 
TEELEALRSSSLGSRTLDPLWKVRRSQKLDM 
SARLELQSALEAEKAKQLVQEELRKVKDAN 
LTl^KUaJSEAKKRELLEEMEILKKKMEEK 
FRADTGKLMLCDSALFEYKYFSNECFYFLFD 
LrVTLEAPTEFQIQY 


202 


1552 


A 


2351 


1 


1003 


PSSYSSDELSPGEPLTSPPWAPLGAPERPEHLL 
NRVLERLAGGATRDSAASDILLDDrVLTHSLF 
LPTEKFLQELHQYFVRAGGMEGPEGLGRKQA . 

A^MM T Til*! 1 VI"li*^^^^Y T *■ * 1 A Till w^ir *■ 

CLAMLLHFLurYQGLLQEEEGAGHlUUJLYL 
LIMKDESLYQGLREDTLRLHQLVETVELKIPE 
ENQPPSKQVKPLFRHFRRIDSCLQTRVAFRGS 
DEIFCRVYMPDHSYVTIRSRLSASVQD1LGSV 
TEKLQYSEEPAGREDSLILVAVSSSGEKVULQ 

HmjrV^yCT A ¥ /^¥VTC¥J¥ T7 a /"^TOT^CnTT? A¥ T7T*T fW5 

el illX-Vr 1 AlAjlNMiLFACJlKXJSYIiALVPLPE 

EIQVSPODTEIHRVEPEDVANHLTAFHWELFR 

CVHELEFVDYVFHOE 






A 
A 




*> 




XTXTT XT<^ A 1?¥1T LI.' TX TOT rXTT7XnY~lf T/"n n^n T> T 

Nl^NCAEPlJpoQlWSLNvNFNTQKjCTVWU 

GYRPVGSIPLWLQNFVRILLNEEDMNVIWD 

WSRGATTFIYNRAVKNTRKVAVSLSVHIKNL 

LKHOASLDNFHFIGGSLOAHISGFVGKIFHGQ 

LGRTTGLDP 


204 


1554 


A 


2390 


280 


476 


SPSLLPQCLMSLSDLSLSPAPPSHLSPRCa'SPQ 

AGSRLGAMRRCAREMDATPMPPAPSCySERV 

T 


205 


1555 


A 


2400 


543 


745 


AAVAUUJISWQQPYPMDFYAOSSLOPWTVN 

HGQDRRPHAPGRPARGKVQEGSARPPSAVAC 

EDCSCR 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
pq>tide 
scq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

coirespondi 

ng to first 

amino add 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
coiresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequrace (A-Alanine OC^ysteine, 
D=Aq>artic Acid, E=Glutanuc Add, 
F°4%cnylalanine, G=Gtycine,H°-Histidine, 
I=Isoleucine, K4,>5ine, L=Leucine, 
M=Methlonine. N=A^paragine, P=Proliiie, 
QKilutamine, R=Arghiine, S=Seme, 
T=Thieonine!, V=Valine, W=Tiyptophan, 
Y=Tyiosine, X=4Jnknown, ♦=Stop codon, 
A^>ossible nudeotide delrtion, V^ssiblc 
nucleotide insertion 


206 


1556 


A 


2406 


122 


48S 


DI^PDSREDHPQGfHRRLLPKRPVRGSLMPGH 
"IHHPCPVSSTTNiyrPDQIWVSVGSLRMGTGG 
MGANASTSPRCWDLSSGNKKWnQVPILASIV 
ESRGGLLATGVGGMCACVPRNQPLTGT 


207 


1557 


A 


2409 


289 


418 


LWTLYRHKQQVQHNHSNRLSCRPSQEDRAT 
HTIMVLDKENTLS 


208 


1558 


A 


2413 


64 


492 


VQGTGXXFIAFrEAMTHFPASPVWAGMFFL 

MLINI^LGSMIGTMAGITTPIIDTFKVPKEMFT 

GOCCVFAFLVGLLFVQRSGNYFVTMFDDYSA 

Tm.TLmLENIAVAWIYGTKKFMQELTE\IL 

GFRPYRFYFYMWKFVSP 


209 


1559 


A 


2417 


3 


877 


EKERLLDEWFTLDEVPKGKLHLRLEWLTLMP 

NASNLDKVLTDKADKDQANDGLSSALLILy 

LDSAKNLPIRYKTNEPVWEENFrFFIHNPKRQ 

DLEVEVRDEQHQCPLGNLKVPLSQLLTSEDM 

TVSQRFQLGNSGPNSTIKMKIALRVLHLEKRE 

RFPDHQHSAQVKRPSVSKEGRKTSnCSHMSG 

SPGPGGSNTAPSTPVIGGSDKPGMEEKAQPPE 

AGPQGLHDLGRSSSSLLASPGHISVKEPTPSIA 

SDBLPIATQELRQRLRQLENGTTLGQSFLGQI 

QLTIP 


210 


1560 


A 


2422 

< 


35 


456 


REFAASDLEPFTPTDQPISreAlTQPSCIKRQRA 
AGNPGSLAATIDHKPCSAPLEPKIQASRNQRW 
GAVRAAESLTDIAEPASPQVHETPIDASQTQK. 
VEPASKSRFTPELQAKVSHSRERALSTMDATP 
HHAQPQROEO 


211 


1561 


A 


2431 


1 


764 


RRYSQKLIQHTACQLLRTYPAATRIDSSNPNP 

lAlFWLHGIQLVALNYQTDDULHLNAAMFE 

ANGGCGYVLKPPVLWDKNCa'MYQKFSPLER 

DLDSMDPAVYSLTIVSGQNVCPSNSMGSPCIE 

VDVIXJMPLDSCHFRTKPIHRNTLNPMWNEQF 

LFHVHEEDLVFLRFAWENNSSAVTAQRnPL 

KALKRGYRHLQLRNUINEVLEISSLFINSRRM 

EENSSGNTMSASSMFNTEERKC1,QTHRVTVH 

GVPG 


212 


1562 


A 


2436 


1 


411 


GIRGTTGHLGCPINDDPSLTLTVSWVMEDKPI 

YICS^GTKKEDDSLTIFAVAKRDHVSDTCOAC 

TOLDHNLDKGYLTVLGEQATPTKRLGALPKG 

RANRTRDLELTYLAERTVRLTWIPGDANNRPI 

TDYDCQIEEHQ 


213 


1563 


A 


2445 


1 


1294 


MSSIGCLWVSRSSQIDGLTAEKSGPEKPHGT 

WLMPELHPKEQBLELLVLEQFLSILPEaXJIWV 

QQHNPESGEESVnXEDLEREFDDPOQQVPAS 

PQQPAVPWKDLTCLRASQESTDIHLQPLKTQ 

LKSWKPCLSFKSDCENSETATKEGISEEKSQG 

LPQEPSERGISEHESNLVWKQGSATGEKLRSP 

SQGGSFSQVIFrNKSLGKRDLYDEAERCLILT 

TDSIMCQKVPPffiRPYRCDVCGHSFKQHSSLT 

QHQRIHTGKCPYKCNQCOKAFSLRSYLIIHQR 

fflSGEKAYECSECGKAFNQSSALIRHRKlHTO 

EKACKCNECGKAFSQSSYLIIHQRIHrGEKPY 

ECNECGKTFSQSSKLIRHQRIHTGERPYECNB 

CGKAFRQSSELmiQRIHSGEKPYECSECGKA 

FSLSSNLIRHQRIHSO 


214 


1564 


A 


2461 


1 


615 


GIPQSnSSSRNIFLEDDLAWQSUHPDSSNTPL 
STRLVSVQEDAGKSPARNRSASrrNLSLDRSG 
SPMVPSYETSVSPQANRTYVRTETTEDERKIL 
LDSVQLKDLWKHCHHSSGMEFQDHRYWLR 
THPNCIVGKELVNWLIRNGHIATRAQAIAIGQ 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 

seq- 
uence 


Met 
hod 


SEQ 
ID NO: 

USSN 

914 


Predicted 

beginning 

nucleotide 

location 

coiTespocdi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to iBSt annuo 
add residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
D-Aspartic Add, BKjIntamic Acid, 
F^Pbcnylalanuie; 0=aiycniei H=Histidin^ 
I=Isoleucine, K=Lysine, IM^eucine, 
A^Methionine;, N=Aspaiagine, P=Proline, 
Q=Glutaniine^ R=Arginine^ SNSerinci 
TMTireonine, V=Valinei W='nyptjvhan, 
Y^TyrosinCi X<=Unknown, *=Stop codon, 
/"^xKsible nucleotide deletion, V^possible 
nucleotide insertion 














AMVDGRWLDCVSHHDQLFRDEYALYRPLQV 
LFSWCQLECSKLIL 


215 


1565 


A 


2464 


3 


2932 


GPOVRSSQDGMADVFVHLRTAWPRCSFISGQ 

HOPGRHGRRVCSSQDSMADVFVHLRTAWPT 

CSUSGQHGPGESVSYEDDDIPAPASLLHVNA 

AAPALTNPTAPVLCTAPNNTAQKEKVPSGMR 

QRPAGVRISSRTPDLTCAVSTHSTVPGVRISSC 

TTOLTCAVSIHSTVPSVaSSCTPDLTCAVSTH 

STVPGVRISSCTPDLTCAVSTHSTVPGVRISSR 

TPDLTCAVSmATVPGVRISSCTPDLTCAVSm 

ATVPGVRISSCTPDLTCAVSTHSTVPGVRISSR 

TPDLTCAVSfflSTVPGVRISSCTPDLTCAVSIH 

ATVPGVRISSCTPDLTCAVSTHSTVPGVRISSR 

TPDLTCAVSIHATVPGVRISSRTPDLTCAVSIH 

ATVPGVRISSCTPDLTCAVSIHATVPOVRISSC 

TPDLTCAVSIHATVPGYRISSRTPDLTCAVSIH 

ATVPGVRISSCTPDLTCAVSTHSTVPGVRISSR 

TPDLTCAVSIHATVPGVRISSCTPDLTCAVSTH 

STVPGVRISSRTPDLTCAVSIHATVPGVHISSC 

TPDLTCAVSTHSTVPGVRISSRTPDLTCAVSIH 

STVPGVCISSRTPDLTCAVSIHSTVPSVfflSSCT 

PDLTCAVSIHSTVPGVRISSRTPDLTCAVSTHS 

TVPGVmSSCTTOLTCAVSIHATVPGVHISSCT 

PDLTCAVSTHTTVPGVRISSRTPDLTCAVSIHS 

TVPGVRISSCTPDLTCAVSTHSTVPGVRISSRT 

PDLTCAVSTHLTVPGVRISSRTPDLTCAVSIHA 

TVPGVmSSCTPDLTCAVSIHATVPGVRISSRT 

PDLTCAVSIHATVPQVHISSCTPDLTCAVSTHS 

TVPGVRISSRTPDLTCAVSIHSTVPGVHISSCT 

PDLTCAVSTHSTVPGVHISSCTPDLTCAVSTH 

STVPGVHISSRTPDLTCAVSIHATVPSVfflSSC 

TPDLTCAVSIHSTVPGLLTSVSQTSTG 


216 


1S66 


A 


2477 


1 


414 


FRTKSYRKGSYRCIVSEWIAEQGNWQEIQEK 

AVEVATWIQPTVLRAAVPKNVSVAEGKELD 

LTCNITTDRADDVRPEVTWSFSRMPDSTLPGS 

RVLARLDRDFLVHSSPHVALSHVDARSYHLL 

VRDVSKENSGYYY 


217 


1567 


A 


2480 


2 


460 


CRTLCEGPQRFEBYEYLGYKAGLYE/UADHY 

MQVLVCQHECVRELATRPGRLSPIENIULHy 

DYLQFAyYRVGEYVKALECAKAYLLCHPDD 

EDVLDNVDYYESLLDDSIDPASIEAREDLTMF 

VKRHKLESELKSAAEGLGXSYTEPNYW 




I508 


A 


2483 


140 


383 


AFSSPHPSPAPQFPEOGFYGLYDKILLFKHDPT 
SANLLQLVRSSODIQEGDLVBYVLSASATFED 
LQIRPHALTVHSYRAP 


219 


1569 


A 


2489 


3 


428 


SSRLVLLAGAAALASGSQGDREPVYRDCVLQ 
CEEQNCSGGALNHFRSRQPIYMSLAGWTCRD 
DCKYECMWVTVGLYLQEGHKVPQFHGKWP 
FSRFUFFQEPASAVASFLNQLASLVMLCRYRT 
FVPASSPMYHTCVAFAWVS 


220 


1570 


A 


2498 


1 


1297 


MDGEAVRFCTDNQCVSLHPQEVDSVAMAPA 

ArftJrKLVyAlrArMAV iL.VroL»V ILr VVUH 

HHFGREAEMREUQTFKGHMENSSAWWEIQ 

MLKCRVDNVNSQLQVLGDHLGNTNADIQMV 

KGVLKDATTLSLQTQMLRSSLEGTNAEIQRL 

KEDLEKADALTFQIINFLKSSLENTSIELHVL 

SRGLENANSEIQMLNASLETANTQAQLANSS 

LKNAKAEIYVIJRGHLDSVNDLRTQNQVLRNS 

LEGANAEIQGLKENLQNTNALNSQTQAFnCSS 
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SEQID 
NO: of 

Duct- 

K^de 
seq- 
uence 


SEQID 
NO: of 

seq- 
uence 


Met 
bod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 

UlKtflCUUUC 

location 
coiresjjondi 
ngtofiist 
amino acid 

muuuc 

peptide 
sequence 


Predicted end 
nucleotide 

IfJulUUll 

cQiresponding 
to last amino 
add residue 
of peptide 


Amino add sequence (A=Alaninc OCysteine, 
D=Aspai1ic Add, E^lutamic Add, 
f— r nenyiBianine^ vjPHjiycine, n=riistiuuic, 
Msoleticine, K=Ly5ine^ L=Leucine^ 
M=M^onine, N=Aspata^e, P=Pn)line, 
Q=Glutanunei, R^Arginine, S^^erine^ 
T=TIireoninc, V=Valine, W=Tiyptopiian, 
I— iyiDsin^ A— unKnown, — oiop couon, 
^^XKsible nucleotide ddetion, V=possible 
nucleotide insertion 














FDNTSAHQFLRGHLERAGDBIHVLKRDLKM 
VTAQTQKANGRLDQTDTQIQVFKSEMENVN 
TLNAQIQVLNGHMKNASREIQTLKQOMKNA 

0 AL. 1 ol^ J VMlJJoNIAiKAh AEIQRlJt.GDLENT 
KALTMEIQQEQSRLKTLHWITSQEQLQRTQ 


221 


1571 


A 


2501 


3 


500 


RVRL>WIXaLSPLMMAAKTGKIGIFQHIIRREV 

TDEDTRHLSRKFKDWAYGPVYSSLYDLSSLD 

TCOEEASVLEILVYNSKIENRHHkCLAVEPINE 

LLKUKWKKrUAVSryiNVvSYLCAMvil" 1 L.T 

AYYQPLEGTPFyPYRTTVDYLRLAGEVTTLFT 

GVLFFFTN 


222 


1572 


A 


2508 


3 


395 


DAHCQRKLAMQEFMEINERLTELHrQKQKL 

ARHVRDKEEEVDLVMQKVESLRQELRRTER 

AKJCEL£VHTEAIAAEASKDRKIJIEQSEHYSK 

QLENELEGLKQKQISYSPGVCSIEHQQEnKL 

KTDLEKKS 


223 


1573 


A 


2544 


2 


412 


NDPAnSNFSAAVVHTlVNhTLliSMTSLEVTK 

MVDERTDYLTKSLKEKTPFFSHCDQAVLQCS 

EASSNKDMFADRLSKSnKHSBWCSKSVIPNID 

KNAVYKESLPVSGEESQLTPEKSPKFPDSQNQ 

LTHCSLSAA 


224 


1574 


A 


2552 


401 


1 


GASLCFISTAh'lVUVLlDSCRFSYPERPnELSM 

CYNIYSUYTVRLTVGRERISCDFEEAAEPVLI 

QEGLKOTOCAnFLLNlYFFOMASSIWWVlLTX 

TWFLAAGLKWGHEAIEMHSSYFHIAAWAIPA 

VK 




1575 


A 


2563 


724 


1 


MSARKERREKGEEEGEGEKDGDEDEKEEEKE 

GLGEEEEKEAGKKKKKQEEKEKEKGAVYSR 

VARICKNDMGGSQRVIJEKHWTSFLKARLNC 

SVPGDSFFYFDVLQSrronQINGIPTVVOVFTT 

QLNSIPGSAVCAFSMDDIEKVFKGRFKEQKTP 

DSVWTAVPEDKVPKPRPGCCAKHGLAEAYK 

TSIDFPDETLSFKSHPLMDSAVPPIADEPWFr 

KTRVRYRLTAISVDHSACaPYH 


226 


1576 


A 


2571 


449 


3 


egvlfvyonyvgdvmnfemaaemaqevaip 

trtvlttddissspiedrdgrrgvagnffifkv 

agaacdrgmsleaceavtrkanbrty™g 

valepcslpqtrrhnfeigaeemeigmgihct 

rgvirekmmpadaivdhimdrifs 


227 


1577 


A 


2575 


3 


1197 


VLSDLCLFYYRDEKEEGILGSILLPSFQIALLTS 

EDHINRKYAFKAAHPmiIRTYYFCTDT<MJEM 

ELWMKAMLDAALVQTEPVKRVDKITSENAP 

TKETNNIPNHRVUKPEIQNNQKNKEMSKIEE 

KKALEAEKYGFQKDGQDRPLTKINSVKI^ISL 

r!sliIbaObAC,FAQTVHYRPINL5sSSENKIVN^ 

LADLRGGNRPNTGPLYTEADRVIQRTNSMQQ 

LEQWIKIQKGRGHEEETRGVISYQTLPRNMPS 

HRAQIMARYPEGYRTLPRNSKTRPESICSVTP 

C'i'Ul kvn f^o/^ AWVD'DCAiTDnrvT'Kjnirf^T vttvit 
o I niJK 1 JLAjyoAJu^lUCKiaMKlJiJIMWQL z£W 

QQRQFYNKQSTLPRHSTLSSPKIMVNISDQT 

MHSIPTSPSHGSIAAYQGYSPQRTYRSEVSSPI 

QRGDVTIDRRHRAHHPKVK 


228 


1578 


A 


2583 


3 


330 


LPFLGLGSVLPQGMVMASPEMNPnCSVFEA 
HrXTJJHATTFSRGFQVTVLVGNVRQTAVVE 
KIHAKVRGTWPFISPEVRKEGGLPQTGREIXD 
PTMGKPHLWWVAA 


229 


1579 


A 


2589 


1 


448 


DDKNAQGKRHVKPTSGNAFTICKYPCGKSR 
ECVAPNICKCKPGYIGSNCQTALCDPDCICNH 
GKCDCPNICQCLPGHGGATCDEEHCNPPCQH 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
pqitide 
seq- 
uoice 


Met- 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

conrespondi 

ngto fiist 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
D-Aspartic Acid, E°<jIutBmic Acid, 
I^=i%cnylalanine, G=Olycinc, H=HistidinB; 
I=Isoleucinei K=Lysine, L=Leucinei 
M=*ilelhionine, N=Aspaiagine, P=Proline, 
QKJlutamine, R=Arghiine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y"-Tyn)sine, X^Unknown, ♦=S(op oodon, 
^==possible nucleotide deletion, V=possible 
nucleotide insmion 














CKjltXA(»IIXrraTfGFVGPRCETMVCNRHC 
ENGGQCLTPDICQCKPGWYGPTCSTA 


230 


1580 


A 


2593 


2 


138 


AVTFSWFAYVADrrQEHERSMAYGLVCMFI 
LYLLYLLRNAFFLR 


231 


1581 


A 


2595 


185 


2 


SGPYTDKlPWFrEEQKLLEQALKTYPVNPPER 
WEKIAEAVPGRTKKACIKRYKVADLRISK 


232 


1582 


A 


2596 




391 


STVTGOPRRLLDTAGHOOPFl RI KTPAMFPOA 

ORARRRTPTCEPATPLCCRRDHYVNFQELGW 
RDWILLPEGYOLNYCSGOCPTHT Afi^frTAA^! 

FHSAVFSLLKANNPWPGRTSWCVPTARRPLS 
LLYL 


233 


1583 


A 


2601 


184 


403 


LLFSDEIIMAAPLRIADVTSGLIGGEDGRVYV 

YNGKETTLGDMTOKCKSWITPCPEEKVNVLQ 

NSIPYWERIT 


234 


1584 


A 


2614 


178 


335 


PLTLGLPENNKPPQADAVPDKELTLFVDSTTL 
DGSKSSDDQKnSYLWEKTQ 


235 


1585 


A 


2616 


2 


896 


DVLEVYGTGVASTRHEMGTLDKHKELEDLV 

AKFLNVEAAMVPGMGFATNSMNIPALVGKG 
( '1 TT DTWUMirrCf VI n a t>t i n a tpto i uy uvtv a 
v^LdijnJL/cviNJtil oJLVlrfUAJvLJjUAlliJirRrlNYA 

QSLEKLLRDAVIYGQPRTRRAWKKIULVEGV 

YSMEQSIVHLPQIIALKKKYKAYLYIDBAHSI 

GAVGPTGRGVTEFFGLDPHEVDVLMGTFTKS 

FGASGGYIAGRKARILSPPACLVPNTGSHSLH 

WDRADKFGDLVDYLRVHSHSAVYASSMSPPl 
AEQIIRSLKLIMGLDGTTQ 


236 


1586 


A 


2621 


1 


392 


NTSSFPAQPSSPARPSLPHLSQHPSNPLLPLAS 
ADHPQCGRFLPLHEPEPLCPSPSLSYPTLVSS 

KMLCIPPGRQQLROAQSMPGHGALSPLLLPP 
A 


237 


1587 


A 


2628 


398 


1 


DLVCKISGFGRGPRDRSEAVYTTMSGRSPAL 

WAAPBTLQFGHFSSASDVWSFGnMAVEVMAF 

GERPYWDMSGQDVDCAVEDGFRLPPPRNCPN 

LMHRLMLDCWQKDPGERPRFSQIHSILSKMV 

QDPEPPNV 


238 


1588 


A 


2631 


1 


1104 


WSPCSLTCGVGLQTRDVPCSHLLSREMNETV 
ILADELCRQPKPSTVQACNRFNCPPAWYPAQ 
WQPCSRTCGGGVQKREVLCKQRMADGSFLE 
LPETFCSASKPACQQACKKDDCPSEWLLSDW 

TLCPPLPFSSSIRPCMLATCARPGRPSTKHSPHI 

AAARKVYIQTRRQRKLHFVGGGFAYLLHKTA 

WLRCPARRVRKPLITWaaXXJHLISSTHVT 

VAPFGYLKIHRLKPSDAGVYrCSAGPAREHF 

VEKUGGNRKLVARPLSPRSEEEVLAGRKGGP 

KEALQTHKHQNGIFSNGSKAEKRGLAANPOS 

RYDDLVSRLLEQGAPCSSSKKKN 


239 


1589 


A 


2636 


1 


678 


MKPDNlLLDEHGHVHimFNIAAMl PBPTr»IT 

TMAGTKPYMAPEMFSSRKGAGYSFAVDWW 

SLGVTAYELLRGRRPYHIRSSTSSKEIVHTFET 

T\ArrYPSAWSQEMVSLLKKLLEPNPDQRFSQ 

LSDVQNFPYMNDINWDAVFQKRLIPGFIPNK 

GRIjg[CX)PTPELEEMILESKPLHKKKKRLAKK 

EKDMRKCDSSQTCLLQEHLDSVQKEFmNRE 

KVNRDa 


240 


1590 


A 


2639 


389 


3 


ELLDPITPMRTKCIELLYAALTSSSTDQPKAD 
LWQNFAREEEHVFTLYSKNIKKYKTCIRSKV 
ANLKNPRNSHLQQNLLSGTTSPREFAEMTVM 
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SEQ ID 
NO: of 

Duel* 

eotide 
uence 


SEQ ID 
NO: of 
peptide 
seq- 


Met 
hod 


SEQ 
ID NO: 

10 

USSN 
914 


Predicted 

beginning 

nucleotide 

location 

coirespondi 

ng to first 

amino acid 

residue of 

peptide 

seQucDce 


Predicted end 
nucleotide ' 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A^AIanine OOysteine, 
I>=Aspartic Add, B=Ghjtamic Add, 
F=fhenyla]aiiin^ 0=GJycin^ H=Histidine; 
b=Isoleudne, K-Lysine, Ir^^eadat, 
^^^^etfaionine, N=3Asparagine^ I^Prolin^ 
Q=Ghitaimnei, R=Ai;ginine^ S==5enne, 
T^Threonine, V«=-Valine, W-Tiyph^han, 
Y=Tyrosinc, X'=Uiiknown, ♦=Stop codon, 
A^ssible nudeotide deletion, V^possible 
nucleotide insetti<Ki 














EMANKELKQLRASYTESaQEHYLPQVIDGTL 
Y 


241 


1591 


A 


2640 


392 


3 


IRLTIUlCVITvIRLATICVLVFrLQSKrrSCDDD 
TCDLCXjYNQKLYPCWETQVGQEMYKLMIFD 
r HULA V IXJ" VIJFPjRlCLL Vl r CSSCXIJQC WGQ 
QEFAIPDNVLGIVYGQTICWIGAFFSPLLPAM 
Y 


242 


1592 


A 


2642 


405 


1 


YFKOTIXLLVGVICVAAAVEKWNLHKRIALR 
MVU^GAKPGMIXIX^FMCCrTLLSMWLSNT 
STTAMVMPTVEAVLQELVSAEDEQLVAGNSN 
TEEAEPISLDVKNSQPSVELIFVNEDILDFLMK 
SPUwnSQAa 


243 


1593 


A 


2646 


412 


2 


CLAMKOIQSSGKIIYFSSLFPYVVUCFLIRAF 

LLNGSnXjKHMFTPKLEIMLEPKVWREAATQ 

VITAimGFGGVlAFSSYNKKDNNCHFDAVL 

VSFINFFTSVLATLWFAVLGFKANVINEKCn' 

QNSETV 


244 


1594 


A 


2650 


1 


1271 


MTTTUGLUCTARLLRLVRVARKLDRYSEYG 

AAVLMLLMC3FAUAHWLACIWYA1GNVERP 

YLTDKIGWLDSLGQQIGKRYNDSDSSSGPSIK 

DKYVTALYFTFSSLTSVOFGNVSFNTNSEKIF 

SICVMUGSLMYASffGNVSAnQRLYSGTARY 

HMQMLRVKEFIRFHQIPNPLRQRLEEYFQHA 

WTYTNGIDMNMVTNGTCSSCTSDDGHFILVS 

NHHQQGLIYSWNDAASMQRPFNHIKSSLLOS 

TSDSNLNKYSTINKIPQLTLNFSEVKTIEKKNSS 

PPSSDKXnAPKVKDRTHNVTEKVTQVLSLGA 

D\a.PEYKl^APRINKrnLHYSPFKAVWDm.I 

LIXVIYTAIFTPYSAAFLLNDREEQKRRECGY 

SCSPLNV^LIVDIMFIlDILINFRTrYVNQNEE 

WSDPASV 


245 


1595 


A 


2656 


385 


2 


NLTWWPLFRDVSFYTVDLIMLIIFFLDNVIMW 

WESLX1X.TAYFCYVVFMKFNVQVEK.WVKQ 

kflNIOTKVVKVTAPEAQAKPSAARDKDEPTLP 

AKPRLQRGGSSASLHNSLMRHSIFQNKIHTLD 

PHV 


246 


1596 


A 


2660 


200 


506 


VLVLQMNYYQAfflJIYYVLFFKVNEFLAFEGPI 
LUJMRDCHLIKTNQLSQATALAKLCSDHPEIG 
nCGSFKQTYLVCLCTSSPNGKLIEEVSMFSFIS 
NYFLS 


*•»/ 




A 
A 


to 


3 




DAWVKNDIIFNQTeRKQKISENIJCHLASVRV 
VQKNLVFWGLSQRLADPEVSPLVFFVILIFF 
VSLSYLEHEDPAQLCDSSEHnS 


248 


1598 


A 


2687 


1 


404 


DFlTlJWUi4MRTlJPSLFGDVRSDVHRFSVTLF' 
GAAIKSVKNPDKKSIENQVLDSLVPLLLYSQD 
iiNUAVAiiKaKQ VL. 1 JtJAQrlJS.WKLrREVYSK 
DPWHKPTEAOTICRFFEKKCKGKINILEQTL 
MYSKNPKL 


249 


1599 


A 


2692 


1 


440 


FRRRRRRRERDCAAQGARRHCRHLAECKLV 
SFPIGIYKVLRNVSGQIHLrrLANNELKSLTSK 
tMi 1 i" aQLK hi .H 1 .r.ONFLHRLFSEVS ALQHLK 
AEDLSKNOFODFPEOLTALPAl ETTNl PPNFIV 
DVPVEKLAAMPALRSINL 


230 


1600 


A 


2693 


459 


21 


LLPGSLGVPILHSQPWDPSPQCPHRAPSTPRRL 
PPLGALSQALmSRAAKNHSQDPGKGTKPFP 
AAPAAPPPRSSLPAPLPMGLKDKGPQPAPPTTF 
NSPWHPATLPGALOPQLSQAAPSPIPPPCLMO 
ISSCPDLKLTKSSTP 


231 


1601 


A 


2694 


2 


404 


FVFDLKLRVPGFAALUHGASSVPGPETVRLR 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 

BONO: 

in 

USSN 
09/496 
914 


Predicted 
beginning 

location 
cotrespondi 
ng to first 

flmifirk Rcid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 

conesponding 
to last amino 
acid residue 

l/wULIUv 

sequence 


Amino acid sequence (A=Alanine 0=C^steine, 
D=Aspartic Add, E^lutamic Acid, 
F— Phenyl&lanine^ Q=01ycine, l^^listidine, 
I=Isoleucine, K=i-yane, LHLeucine, 
Ait^Metiiionine, N^vVsparagine, PNE^Iine, 
Q=Glutamine, R=Arginine, S^^erine, 
1 — jnicoiune, v='Vaime^ w — iiypiopu&n, 
Y=Tyroane, XHJnknovm, *=5top codoo, 
A=possibIc nucleotide deletion, V=possible 
nucleotide insertion 














QKMCKKAPDHSSGRKEEL>nTHTVDKLETKK 

r V Kits. V i.*VrfUJLfoVjrcu^xlol.JJUi^JKJuv i cJvKAlfOuri 

RKAGFPEHPVAPEPLSNSCQISKEGREQVLSEI 
OAGDCL 


252 


1602 


A 


2697 


421 


1 


PQKSHSGAYQCTATRKAQTAQDFAIIALEIXJ 
1 j'KJ Voorolilv V VWrvjJtiv^r bLMCAAKvjAPFPr 
VTWAU5DEPrVRDGSHRIKQYTMSDGrnSH 

NVROPPSIRAMRlSirr 


253 


1603 


A 


2698 


65 


401 


ACCXJWRRTLIPAKSTTVSCnSTPHHPFRGSYS 
FDDHTTDSEALSRSSHVFrSHPRMLKRQPAIEL 
PLGGHYSSDVPRPLSTQLSSSLLGYFSTLMTG 
AAFTNNlASSniL 


254 


1604 


A 


2699 


438 


301 


GQMSQDDPPFIDQLGFGVAPGFQTWACQEQ 
RVRGPWEAGPGVGY 


255 


1605 


A 


2700 


1 


842 


lqnrbdssegkkklveaeeLeekhreaqvs 

AQHLEVHLKQKEQHYEEKIKVLDNQIKKDLA 

dketlbnmmqrheeeahekgkilseqkamin 

amdskirsleqrivelseanklaansslftqr 

nmkaqbemiselrqqkfyletqagkleaqn 

rkleeqlekishqdhsdknrllelbtrlrevs 

leheeqklelkrqltelqlslqeresqltalq 

aaraalesqlrqakteleettaeaeeeiqalt 

vglgsnifrllkasarmsvelalsilahp 


256 


1606 


A 


2701 


2 


405 


fvggpgadppvavmwdpraarmdltayab 
llkesgnqvlkngnfslairkydeaiqillql 

YQWGVJtTKIJLAVIXCMKSNAFFSLGKWNEA 

fvaakeclqwdptyvkgyyragysllrlhq 
pyeaarmffeglr 


257 


1607 


A 


2702 


2 


399 


fvesassrppgcfsgdgrfwlvsegsrrgwd 
fnpsfsfldprysvgodeniotvttlanilref 
npslkgfsvgtgketspnaflnqavaggrae 

DIJVQARRLVIM.MKiroTRIHFQEDWKirrLFI 

ggndl 


258 


1608 


A 


2709 


1 


1097 


SVGARQGEARDRIRRFFPKGDUEVLQAQVERI 

MTRKELLTVYSSEDOSEH'ETIVLKALVKACG 

SSEASAYLDHLRLAYAWNRVDLVQSELFRGDl 

QWRSFHLEASUiiDALLNDRPEFVRLLlSHGLS 

iATHrl.,llrMKl^QLYaAArSNSlJRNLLDQASH 

SAOTKAPALKGGAAELRPPDVGHVLRMLLG 

KMCAPRYPSGGAWDPHPGQGFGESMYLLSD 

KATSPLSLDAOLGQAPWSDLLLWALLLNRA 

QMAMYFWEMGSNAVSSALGACXJXRVMAR 

SSEVRAARLLLRRCPLWGDATCLQLAMQAD 
ARAFFAQDGVQSLPTQKWWGDMARR 


259 


1609 


A 


2721 


1 


403 


vyloagpglffsneoakbgekanipklmlpr 
gofsqremvtghrspspeeeeeeeeegfgera 
scrrgufrvrltrvglaapskasrgqegdaa 

gglrvrlp 


260 


1610 


A 


2728 


1 


477 


LLOGDLRYHLQQNVHtTEO'l'VKLYlCELALA 
LEYLQRYHnHRDKPDNILLDEHGHVHrrDFN 
lATWKGAERASSMAGTKPYMAPBVFQVYM 
DRGPGYSYPVDWWSLGITAYELLRGWRPYEI 
HSVTPIDEILNMFICVERVHYSSTWCKGMVAL 
LRK 


261 


1611 


A 


2730 


3 


547 


ltttdfilvlyryyrsplvqiyeieqhkietwr 
eiylqgcfkplvsispndslfeavyiijknrih 



159 



wo 01/57188 



PCTAISOl/03800 



SEQID 
NO: of 
nucl- 
eotide 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 


Predicted 
bcgfaming 
nucleotide 
location 


Predicted end 
nucleotide 
location 
come^xmding 


Amino acid sequence (A-=Alanine OCysteine, 
D=Aspartic Acid, E=Glutamic Acid. 
F^flienylalaninci G=Glycinei H=H!stidinc, 
NIsoleucine, K=Lysine; LrleaaBe, 
M=Methionioe, N=Asparagine, P=Proline, 
Q^Glutamin^ R^Aigkine, S=Serinev 
T=Threonine, V=Valinc. W-Tiyptcqjhan, 
Y=^Tyiosine, X=Unknown, •=Stop codon, 
/*=possibIe nucleotide deletion. V=possible 
nucleotide insertion 


seq- 
uence 




09/496 
914 


correspond! 
ng to first 
amino acid 
residue of 
peptide 
sequence 


tolastaooino 
acid residue 
ofpqitide 
sequence 














RLPVLDPVSGNVlJffl.THKRLLKFLHIFGSLLP 
RPSFLYRTIQDLGIGTFRDLAWLETAFILTAL 
DIFVDRRVSALAWNEOGTHPQDERLOLQW 
. GLGEPGSEERLFPAAITSR 


262 


1612 


A 


2733 


3 


431 


GPEFPGSAKLVFLDLSYNNLTQLGAGAFRSA 

GRLVKI^IJVNNNLVGVHEDAFETLESLQVLE 

LNDNNLRSLSVAALAALPALRSLRLDGNPWL 

CDCDFAHLFSWIQENASKLPKGLDEIQCSLPM 

ESRRJSLRACRRPASRV 


263 


1613 


A 


2736 


2 


343 


PARISGVDPPVRKATKGGENCSFH5NKNWQF 
LWGLNGNROTKEPWGGRNNHAKGFRTTW 
ARSSSQNNRTFQNNRNFUILQRDSQKKGQFA 
RUSPLVNLPQSPGGLEFQYQAT 


264 


1614 


A 


2738 


2 


243 


RAMLKCLREGQPPPSYNWTRLDGPLPSGVRV 
DGDTLGFPPLTTEHSGiyVRHDTNEFSSRDSH 
DTVDVLDPPEDSGKQVDL 


26S 


1615 


A 


2752 


2 


388 


AAGDAPLRSLEQANRTRPPFFSDVKGDHRLV 
LAAVETm,VLIFAVSLLGNVCALVLVARRR 
RRGATACLVLNLFCADLLFISAIPLVLAVRWT 
EAWLLGPVACHLLI'YVMTLSGSVTILTLAAV 
SLER 


266 


1616 


A 


2755 


192 


1 


AFREVGGYWGLLCEHLYAIPSKTSEGNWTAK 

LQGYLPLQDAFHIFQDPLTGDLPWPEiaLGLP 

V 


267 


1617 


A 


2760 


434 


714 


ASRLEKQNSTPESDYDNlKNDMiiPDGMGYM 
HRTSVPGEGLPRARDLAGLGQQKQFTTHTPF 
LYFQTHKGLKDSSIRSEVTCLGISQCWRKGFF 


268 


1618 


A 


2762 


1 


405 


lAClFCGQDEWSPERS'l'RCl-'RRRSRFLAWGEP 

AVIXLLLLLSLALGLVLAALGLJFVHHRDSPL 

VQASGGPLACFGLVdGLVCLSVLLFPGQPSP 

ARCLAQQPLSHLPLTGCLSTLFLQAAEIFVESE 

LPLSWAE 


269 


1619 


A 


2772 


3 


243 


TRPAEKIQYLVLFFVMSHPSQAYDKLSLSDHL 
LLVVLNLLRREVSBHGRHLQQYFNUFVMYAN 
LSKNLSFSEFCFDVSY 


270 


1620 


A 


2789 


1 


486 


ELQSQQACTHTKETEQLRSQLQTIJCQQHQQA 

VEQIAKAEETHSSLSQHXJARLQTVTREKEEL 

LQLSIERGKVLQNKQAEICQLEEKLEIANEIMl 

KHALERFEQEAVAVDSNLRVRELQRKVDGIQ 

KAYDELRLQSEAFKKHSLDLLSKERELNGKL 

RHLSP 


271 


1621 


A 


2795 


1 


568 


KEKRVTVQLFTESIQKNQEDKLXMVPRKQRE 

FSGSDRGKLPGSEma^QGPSKaGRKEERLITB 

RKHEHLKNKSAPKVVKQKVIDAHLDSQTQN 

FQQTQIQTAESKAEHKKLPQPYNSLQEEKCLE 

VKGIQEKQVFSNTKDSKQEITQNKSFFSSVKE 

SQRDDGKGALNIVEFLRKREELHQILSTVKQP 


272 


1622 


A 


2797 


8 


523 


KCMQGKYAGAMESEPCVCTEADFDCDYtiYE 

RHSNGQCLPAFWFNPSSLSKDCSLGQSYLNST 

GYRKVVSNNCTOGVREQYTAKPQKCPOKAP 

RGLRIVTADGKLTAEQGHNVTLMVQLEEGD 

VQRTLIQVDFGDGIAVSYVNLSSMEDGIXHV 

YQNXGDCRXTVQVDNSLGS 


273 


1623 


A 


2801 


72 


395 


HPSRSNVGPRQLTVWNlJiNtSHUNRRKYIFS 
DEEGQNQLGIRIHQDIPLPPRRRELPALRTTNG 
KADSLNVSRNSVMQELSELEKQIQVIRQELQL 
AVSRKTELEEYH 


274 


1624 . 


A 


2805 


168 


320 


ILWLYFKIOTWVYPVFAKLSLLGLAALFSLRE 
IFIARNGWGETLTHCKRV 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

bcgiiming 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
cotresponding 
to last amino 
add residue 
of pqitide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
I^A^jartic Acid, &^lutamic Acid, 
P=4>henylalanine, G=G|ydne, It=Histidine, 
I=Isoleucine, K=Lyane^ Lr=Leucine, 
M=Metfaicmine,N=Aqnragine, P=Proliiiei 
Q^Ghitanune, R^^Argiaine, S=^erine, 
T"=Threoninc, V=VaIinc, ^Tiyptophan, 
Y=Tyrosinei, X^Uoknowi^ *=StDp codon, 
A^ossible nucleotide deletion. V=possible 
nucleotide insertion 


275 


162^ 


A 


2812 


208 


321 


GSLATCQLSEPLLWFILRVLDTSDALKAFHD 
MGKIIFQ 


276 


1626 


A 


2813 


41 


266 


AORSLHOAGDRAWVGISPTDWSPKWELCK 
KYQQQTWAIDLAGDETIPGSSLLPGHVQAY 
QVGPVRRNGEAGPG 


277 


1627 


A 


2817 


3 


410 


VLQERLDNFQRKCIQLASSTEGKVDKLLMRN 

LFISYLHTHODCQHEVLQAMGSILGITOEEME 

PLFQEEHGTATRWNfTGWI-EGGSKSVPKTPL 

GLNQQPALNGSFSELFVKFLKTEa^STLPTX 

LRPHNSPGKIK. 


278 


1628 


A 


2821 


238 


457 


GLSGPSCSCPHSPLPTUSRAQLETALKWRNYE 
VKlJUXLHLEELOMEHDIRIlYDLESVPNfTWD 
PVDQNPRLV 


279 


1629 


A 


2822 


342 


1 


PUPANLPAHSNPLQPLPSLPHPFLPATHKFPT 
TPPTFSSVPreLPSLSSILHHSPLHSELNPHLQS 
CRLPSRPSVSRELPPOSOPASSVPI^PTPI PDS 
VPSQRHPTXPPPAS 


280 


1630 


A 


2825 


307 


77 


PSMVWSYHWGVKQKRLALCVFSFEEGGRMC 
CGQYWPLEKDSRIRFGFLTVTNLTGAVGEPG 


281 


1631 


A 


2827 


81 


381 


KMGTAVWVPKEKEKRDKASQEGGDVLGAR 

QDCTPSLKSLVATGNLLDLEETAKAPLSTVSA 

NITNMDEVPRPQALSGSSWWVSOCVASRS 

VTT T^fi 


282 


1632 


A 


2830 


471 


160 


KLPXDKYELEPSPLTQYILERKSPHTCWQVFV 
TSSGKYNELGYPFGYLKASTTLTCVNLFVMP 
YNYPVIXPLLDDLFKVHKLKPNLKWRQAFDS 
YLKTLPPyVL 


283 


1633 


A 






148 


MKHYLHSQACSVFMYHLSPRTFPRYPGLMVP 
PLQCQMHPEESTQFSIKLQPPPVGRKNRERVE 
SSEESAP 


284 


1634 


A 


2836 


2 


384 


KTIJRTLLDILADG'nLKVGVGCSEDASKLLQ 
DYGLWRGCLDLRYLAMRQRNNLLCNGLSL 
KSLAETVLNFPLDKSLLLRCSNWDAETLTED 
QVIYAARDAQISVALFLHLLGYPFSRNSPGEK 
KR 


285 


1635 


A 


2843 


20 


271 


PIRPYYSYSGLDRDCSWLPLAKAWLPDVMIL 
VCDRVSEDGINRQQAQEWCIKHGFELVELSP 
EHJPEEDGKCLCVRRKYGTYl 


286 


1636 


A 


2845 


197 


278 


TAEDVLTVAYEHGVNLFDTAEVYAAGK 


287 


1637 


A 


2851 


2 


427 


FVAEVRREWAKYMEVHEKASFTNSELHRAM 
NLHVGNLRLLSGPLDQVRAALPTPALSPKDK 
AVLQNLKRILAKVQEMRDQRVSLEQQLRELI 
QKDDirGSLVrrDHSQMKKLFffiQlJaCYDQL 
KVYLEQNLAAQDRVLCALT 


288 


1638 


A 


2859 


2 


469 


FVNLGILTCIECSGIHREMGAHISRIQSLBLDK 

LGTSELLPAKNVGNNSFNDIMEAMLPSPSPKP 

TPSSDNfTVRKEYITAKYVDHRFSRKTCSTSSA 

KLNELLEADCSRDLLALIQVYAEGVELMEPLL 

EPGQELAETALHLAVRTADQTSLHLVE 


289 


1639 


A 


2861 


2 


454 


FVASGGPATARMSDSQFFCVAEERSGHCAW 

DGNFLYVWGGYVSIEDNEVYLPNDEIWTYDI 

DSOLWRMHLMEGELPASMSGSCXIACINGKL 

YIFGGYDDKGYSNRLYFVNLRTRDETYIWEK 

ITDFEGQPPTPRDKLSCWVYKDRIJYFG 


290 


1640 


A 


2868 


1 


378 


FRQGQLYKVFLHGSQGQVYHSQQVGPPGSAI 
SPDLLLDSSGSHLYVLTAHQVDRIPVAACPQF 
PDCAS<XQAQDPLOGWCVLQGRCTRKGQC» 
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SEQID 
NO: of 
nud- 
eodde 
seq- 
uence 


SEQID 
NO: of 
pqitide 
seq- 
uence 


Met 
bod 


SEQ 

IDNO; 

in 

USSN 
09/496 
914 


Predicted 

bcgiiming 

nucleotide 

location 

conespondi 

og to first 

amino add 

residue of 

peptide 

sequence 


Predicted end 
nuclcqtide 
kx»tion 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A^Alanine OCystdne, 
I>=A^>artic Acid, B=Ghitamic Acid, 
F=Phenylalanine, G=Gly(ane. It=Histidine, 
t=Isoleucine, KHLysine, IM^eudne, 
N^Methionine, N^Asparagin^ P=Prolin^ 
Q^Ghitamine, R=Arg^iine, S=^«rine, 
T-Threonine, V=VaIine, W=Tiyptophan, 
Y=7yrosinc, X=Unknown, *=Stop codon. 
^=possible mideotide deletioo, \=^)ossible 
nucleotide insertion 














ragqlnqwlwsyeedshclhiqsllpghhpr; 

QE 


291 


1641 


A 


2870 


1 


385 


FRYKIPNNRQQLLRKRHIGNDIVTIVFQEPGAL 

PFTPKSIRSHFQHVFVIVKVHNPCrENVCYSV 

GVSRSKDVPPFGPPIPKGVTO'KSAVFRDFLL 

AKVINAENAAHKSEKFRAMATRTRQEYUCD 

LA 


292 


1642 


A 


2877 


3 


188 


RKIWPPATTQSPESTMDTSLKKEKSAILDLYI 
RPPPAVPYSPRYVAVHCHOMLVSCWCHL 


293 


1643 


A 


2878 


1 


427 


REKEEEVEEEEDKWKETEKEAEQEKEEDSL 

GAGIWPDAAIPSGERTCGSEGSRSVLDLVNYF 

LSPEKLTAENRYYCESCASLQDAEKWELSQ 

QPCYULTLLRFSFDLRTMRRRKILDDVSIPLL 

LRLPLAGGRGQAYDL 


294 


1644 


A 


2879 


109 


245 


QIXCFaPRQTTUVYIl^nGMVIFITTLDLRYI 
nVFVTGGVLG 


295 


1645 


A 


2880 


3 


320 


LASSQHGILNNLSLLFSICKTCIRTMDHHCPRA 
NNCVOEQNHRFFCALHCKSKHFCIEFTLNTNF 
FNOOPGAEKSnDAPFSLQPFLQDSKYNTALS 
LSESISQ 


296 


1646 


A 


2S92 


209 


363 


SQYSHSLDYHLLQVTKNPFTLGDSSNPGQTE 
RLQEFSQKMDQVRGHWPVST 


297 


1647 


A 


2893 


8 


424 


SPXILXLDTFILLOIQDNILVLILATPPFMAGG 

KLYSTMGRFLRDRKNPACREMAWLLANLA 

QGDSLAARAIAVQKGSIGHLLGFLEDSLAAT 

QICJQSQASLLHMHNPPFEPTSVDMMRRACRA 

LLALAKVDDNHSEF 


298 


1648 


A 


2894 


310 


445 


FWIYFPSFFMTGYLPLGFEFAVEITYPESEGTS 
SGLLNASAQVNL 


299 


1649 


A 


2898 


1 


492 


KKAKNLTNYDLCSIFLGTSTLLVWVGVIRYL 
GYFQAYNVULTMQASLPKVLRFCACAGMIY 
LGYTFCXiWIVLGPyHDKFENLNTVAECLFSL 
VNGDDMFATFAQIQQKSDLVWLFSRLYLYSn 
SUrmiLSLFIALITDSYDTIKKFQQNGFFETD 
LQEF 


300 


1650 


A 


2901 


1 


445 


PVWWNSLNGASEVTFSVHVKDGGSFPKTDST 
TVTVIU'VNKADFPKVRAKEQTFMFPENQPVS 
SLVmTGSSLRGEPMSYYIASGNLGNTFQIDQ 
LTGQVSISQPLDFHGQKYWWIEARDGGVPP 
FSSYEKLDirVLDVNDNAPIF 


301 


1651 


A 


2902 


162 


433 


THFIC3LPLGYCFPLLX>KIM.QLPSGFNCNFDFLE 

EPCGWMYDHAKWLRTTWASSSSPNDRTFPG 

KPAVSEDMKELRPACSTYFNPRFPYKL 


302 


1652 


A 


2909 


2 


412 


OPQMUaCKIYFIWVTRSQCQFEWLADIMQEV 

EENDHQDLVSVfflYVTQLAEKFDLRTTMLYI 

CERHFQKVU^lSUT'GLRSrrHFGRPPFEPFFN 

SLQEVHPQVRKIGVFSCGPPQMTKNVEKACQ 

LVNRQDRAHFM 


303 


1653 


A 


2914 


291 


453 


XLNRWLCFFYSWSFGBLLYEMVTLOAPPYPE 
VPPTSILHILQRRKIMKRPSSCS 


304 


1654 


A 


2926 


179 


354 


PGVPSQAUIKAESLKKCLSVMEAKVKAQTAP 
NKDVQREIADLGEVGAASLPPSSGPGA 


30S 


1655 


A 


2938 


135 


438 


GMGYLHAKGnJHKDLKSKNVFYDNGKVVlt" 
DFOLFSISGVLQAGRREDKLRIQNGWLCHLA 
PEIIRQLSPDTEEDKLPFSKHSDVFALGTIWYE 
LHAREWP 


306 


1656 


A 


2944 


2 


329 


VRWNSCVNCSCAFGNGASLSTSLGESSGCLW 
EIGKWLSCSLLSFPSPLAVUITFCIVTVLGREA 
LTKGALWAVFLLAGSAIiCAEVTGVIWRQPE 
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SEQ ID 
NO: of 
nuci' 
cotidc 
seq- 
uence 


SEQ ID 
NO: of 

npntiftp 

ueace 


Met 
bod 


SEQ 
ID NO: 
in 

09/496 
914 


Predict 

beginiusg 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
aacleotidc 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequmce (A^Alanine 0=Cystcinc, 
D=Aspartic Acid, E=01iitamic Acid, 
F^=Phenylalanine, G=Gfycine, li=Histidine, 
I=Isoleucine, K=^ysine, Ij=Leuctne, 
N^Metbionine, IiNAsparagine^ P=Rolinc, 
Q=Glutainine, R=AiBiiine, S=Serine, 
T=niTeonine, V=Valin6, W=Tiyptoplian, 
Y=Tyrosinei X^Unknown, *=Stop codon, 
A'^ssible nucleotide deletion, \Fpossible 
nucleotide insertion 














SKTKLSFKVSSSA 


307 


1657 


A 


2950 


2 


411 


NYLCIAKNSAGSAMGKTRLWQVPPVIENGr" 
PDLSTTEGSHAFLPCKARGSPEPNITWDKDGQ 
PVSGAEGKFTIQPSGEULVKNLEGQDAGTyT 
CTAENAVGRARWIVHLTILVLPVFTTLPGDRS 
LKLCjUKL wlk 


308 


1658 


A 


2951 


1 


407 


PTRPPRVRFDNEFDAESQRKRTTSVSKMERM 

DSSLPEHEEDEDKEAINGSGNAENRERHSBSS 

DWNDCTVPSYNQTNSSMDFRNYMMRDETLEP 

UTOWEMAYTDTGMnTIDHNTXTTrWLDP 

RLCKKAKAPEDC 


309 


1659 


A 


2954 


2 


179 


QDFLTLTLTHP'IXjIXYVGAREALFAFSMEALE 
LQGAVRGGAVOGSRACQRARPRGAVLO 


310 


1660 


A 


2959 


1 


419 


QDMMERAUDTFVGHDWEPGSYVQMFPYPC 
YTRDDFLFVIEHMMPLCMVISWVYSVAMTIQ 
mVAEKEHRLKEVMKTMGLNNAVHWVAWFI 
TGFVQLSISVTALTAILKYGQVLMHSHWnW 
LFLAVYAVATIMFCF 


311 


1661 


A 


2963 


3 


465 


MKPQMPGLGAPNGYGPGRGRAGVPGGPERR 

PWVPHlXPFSSPGYLGVMKAQKPGAGEC»fiC 

PQKPGLRGTLKPQKSGHGHENGPWPGPCNA 

RVAPMLLPRLPTPGVPSDKEGGWGLKSQPPS 

AVQNGKLPGHQPPNGYGPGAEPGFNGGLEPQ 

KI 


312 


1662 


A 


2967 


3 


405 


WLAQEWSPCrVTCGQGLRYRVYLCIDHRGM 

HTGGCSPKTKPHIKEECIVPTPCYKPKEKLPV 

EAKLPWFKQAQELEEGAAVSEEPSFIPEAWS 

ACTVTCGVGTQVRTVRCQVLLSFSQSVADLPI 

DECEGPKPA 


313 


1663 


A 


2969 


2 


430 


WADNCRQGYLDALRFLHIRGLTKEPVLWT 

LVSKEPPAPADGNWDAGCDQRRKGGLSLNW 

KVPHVQVKDVPNFEQLSPELEAALKKACTRD 

PSRWARFWHSGPGQVLTYLLLPCTLPFEYIYF 

RSRRLWWLPDVPADLWWMQ 


314 


1664 


A 


2971 


422 


33 


U3XSHKALQRLRPOWLAPIJPQLRALHLDHNB 
LDALGRGVFVNASGLRLLDLSSNTLRALGRH 
DLDGLGAT .KKl .1 J .FNNRLVHLDEHAFHGLRA 
LSHLYLGCNELASFSFDHLHOLSATHLLTLOL 
SSNRM 


31S 


1663 


A 


2973 


1 


525 


rrVSTHASGSPFGLEPQSOWLWVRAALDREA 

QELYnXVMAVSGSKAELGQQTGTATVRVSI 

LNQNEHSPRLSEDPTFLAVAENQPPGTSVGRV 

FATDRDSGPNGRLTYSLQQLSEDSKAFRIHPQ 

TGEVTTLQTIJDREQQSSYQLLVQVQDGGSPP 

RSTTGTVHVAVLDLNDNT 


316 


1666 


A 


2978 


2 


400 


ELWELVSAGKSGraRNTyEVQYVTGNVPKA 

GTDANVYLTIYGEEYGDTGERPLKKSDKSNK 

FEQGQTDTFnYAIDLGALTKIRIRHDNTGNR 

AGWnjJRIDrroMNNEITYYFPCQR'WLAVEE 

DDGQLSRE 


317 




A 

A 






440 


VLNCQGRPlKi" VRINGDGQEVLYLAESDNVR 

LGCPYVLDPDDYGPNGLDIEWMQVNSNPAH 

HRENVFLSYQDKRINHGSLPHLQHRVRFAAS 

DPSQYDASINLMNLQVSDTATYECRVKKTTM 

ATRKVIVTVQARPAVPMCWTEGQ 


318 


1668 


A 


2995 


119 


414 


LPEKEPPIIRKSSSLKVTKC3LFTEQPKPIIILRFA 
ENYDARLUUDIANTLREQVQELFNKTYGKQ 
RRTPGEGHVAAVDREVAGFPVPAEGISGETIH 


319 


1^ 


A 


2999 


2 


332 


gffaytygrLvvvedlhsGaqqhws6hsA£i" 
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SEQ ID 

KO:of 

nucl' 

eotide 

seq- 

UCDC6 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 


Ptedicted 

b^inning 

nucleotide 

location 

conespondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amiix) 
acid residue 
of peptide 
sequence 


AmiiH> acid sequence (A^Alanine 0-Cysteinc 
I>=Asparttc Acid, EKjlutamic Acid. 
Ft=Phenylalanine, G=Glycinei H=Bistidine. 
I=Isoleticine, K=Lysine, LNLeucine, 
M=Md]uonine, N^Aspanigine, I^^roline, 
Q=Ghitainise, R^Aighune, S=^erine, 
T=Threonine, V-Valine;, W=Tryptoirfian, 
Y^^yrosine, X°=Uiilaiowri, *=?Stop codon. 
/^possible nucleotide deletion, V=possibIe 
nucleotide insertion 














STLALSHSAQVLASASGRSSTTAHCQIRVWD 
VSGGLCQHUFPHSTTVLALAFSPDDRLLVTL 
GDHDGRTLALWGTGHL 




10 /U 


A 

A 


3000 


693 


322 


IDESTGLmVNyLDYETK.TSYMMNVSATDQA 
PPFNQGFCSVYITLLNELDEAVQFSNASYEAA 
ILENLALGTEIVRVQAYSIDNLNQITYRFDAY 
TSTQAKALFKIDArrVRGWGQGAPFFPI 


321 


1671 


A 


3001 


6 


383 


RIPRGKACXTVLORSTGELEOFASSRLPPQPC 
GWGQSSDLLSRIDLDELMKKDEPPLDFPDTIJE 
GFEYAFl^JEKGQLRHIKTGEPFVFNYREHLHR 
"WNQKRYEALGEIITKYVYELLEKDCNSKKVS 


322 


1672 


A 


3007 


192 


447 


ERVRNSLFPGRGDSQCACCPSSPVWVFLETGF 

LFPWLFLQYEVnCKAYMQGEVEFEDGENGK 

DGAASPRNVGHNIYILAHQLARH 


3Z5 


1673 


A 


3019 


18 


243 


KEIXFYHIJVNNINFFNTRYAKIHIPIIASVSEH 
QPrnVVSFFFDlJfflLVCrFPAGLWFCIKNIND 
ERVFGKRGF 


324 


1674 


A 


3020 


523 


797 


LCYFSARYHQRKIFOILYlFrLSAINRKEPNLFI 
YLFIFFEMESHSVTHAOVQRHNLNSLQPLPPG 
FKRFSCLCFLSSWNYRGAPPGPANF 


325 


1675 


A 


3022 


2 


156 


NDFLPLYFGWVLTKKSSbTLRKAGQVFLEEL 
GNHKAFKKELRQCRWQVGAL 


326 


1676 


A 


3023 


38 


172 


KMVRGSKKLISFFPOGPYGILAORDPSKGLAT 
FCLNKEALKDEFE 


327 


1677 


A 


3027 




385 


LTLEtLLLPAASELAHGKRLACCrVDHKLPEC 
GFYGLYDKDLLFKHDPTSANLLQLVRSSGDIQ 
EGDLVEWLSASATFEDFQIRPHALTVHSYRA 
PAFCDHCGEMLFGLVRQGLKCDGCGLNYHK 
RC 


328 


1678 


A 


3030 


13 


5^9 


It KFl ISCQRPGPGLAAGMLPYTVNFKVSART 

LTGALNAHNKAAVDWGWQGLIAYGCHSLV 

WIDSrrAQTLQVLEKHKADWKVKWAREN 

YHHNlGSPYCaLRLASADVNGKnVWDVAAGV 

AQCEIQEHAKPIQDVQWLWNQDASRDLLLAI 

HPPNYIVLWNADTX3TKLWKKSYADNILSFSF 

D 




1679 


A 


3038 


90 


744 


SVNLPPSLWPWEEAMDSTKSEPLKGSPEAED 

G»>aEYKKLVNPSQYRFEHLVTQMKWRLQEG 

RGEAVYQIGVEDNGLLVGLAEEEMRASLKTL 

HRMAEKVGADITVLREREVDYDSDMPRKITE 

VLVRKVPDNQQFLDLRVAVLGNVDSGKSTL 

LGVLTQGELDNGRGRARUJLFRHLHEIQSGR 

TSSISFEBLGFNSKGEVHGINGTQWGQTLRMG 

W 


330 


1680 


A 


3040 


3 


397 


LCSllXLLTIPSWVLSQrrLKESGPrLMKPTBr 

LTLTCTFSGFSLNTSGVOVAWIRQPPGKALE 

WLALIYWDDDKRYSPSLNDRLHAKDTSRNQ 

VVLWrNMGPVDTATYYCAQFARGARGSN 

WFDPWGQ 


331 


1681 


A 


3043 


3 


1509 


AGBRHEAPPTTSNRHRRQIDRGVTHLKISGLK 
MPRQIAroWVAONVYWTDSGRDVlEVAQMK 

ulMxK&. il^ouMlUlirilAl V VL/rLJvu 1 MYW5D 

WGNHPKIETAAMDGTLRETLVQDNIQWPTG 

LAVDYHNERLYWADAKLSVIGSIRLNGTDPI 

VAADSKRGLSHPFSIDVFEDYIYGVTYINNRV 

PKIHKFGHSPLVNLTGGLSHASDWLYHQHK 

QPEVTNPCDRKKCEWLCLLSPSGPVCTCa«NO 

KRLDNGTCVPVPSPITPPDAPRPGTCNLQCTN 

GGSCFLNARRQPKCJICXJPRYTGDKCELDQC 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
pqitide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

cotrespondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corre^nding 
to last amino 
add residue 
of peptide 
sequence 


Amino add sequence (A-Alanine OCysteine, , 
D^-Aspartic Acid, E-Glutamic Add, 
P=I1iaiylalanine, Q=Olycine, H=Histidine, 
I=Iso1eucine, K=Lysine, LHLeudne, 
M=Methionine, N=Asparagine, P=Proline, 
Q=Glutainine, R=Arginine, S=Serine^ 
T=Threonine, V=Valine, W=Tiyptophan, 
Y=TyTosinc, X°4Jnknown, *=6top codon, 
/=possible nucleotide deletion, V=possible 
nudeotide insotion 














WEHCRNGGTCAASPSGMPTCRCPTGFTGPKC 

TQQVCAGYCANNSTCTVNQGNQPQCRCLPG 

FLGDRCXJYRQCSGYCENFGTCQMAADGSRQ 

CRCTAYFEGSRCEVNKCSRCLEOACWNKQS 

GDVTCNCTDGRVAPSCLTCVGHCSNGGSCT 

MNSKMMPECXJCPPHMTGPRCEEHVFSCJQQP 

GfflASnJP 


332 


1682 


A 


3045 


3 


952 


nTlSNFHTQVNRTYCCGTYRAGPMRQISLVG 

AVDEEVGDYFPEFLDMLEESPFLKMTLPWGT 

LSSLRLQCRSQSDDGPIMWVRPGEQMIPTAD 

MPKSPFKKRRSMNEnCNLQYLPRTSEPREVLF 

EDRTRAHADHVGQGFDWQSTAAVGVLKAV 

QFGEWSDQPRTTKDVICFHAEDFTDVVQRLQ 

LDLHEPPVSQCVQWVDEAKLNQMRREGIRY 

ARIQLCDNDIYFIPRNVIHQFKTVSAVCSLAW 

HIRLKQYHPWEATQNTESNSNMDCGLTGKR 

ELEVDSQCVRIKTESEEACTEIQLLTTASSSFP 

PASE 


333 


1683 


A 


3046 


497 


167 


SACSTGPELPGRATRSLTRPANQKGCDGDRL 
YYDGCAMIAMNGSVFAQOSQFSLDDVEVLT 
AILDLEDVRSYRAEISSRNLAVSAPVDTCVG 
CSSKTWKVAPFVRAWWRP 


334 


1684 


A 


3053 


37 


276 


VriULEEQLNQLTEDNAELNNQNFYLSKQLD 
EASOANDEIVQLRSEVDHLRREITEREMQLTS 
QKQVRRVNKWRSLEDF 


335 


1685 


A 


3054 


2 


846 


WDAWGDWSDCSRTCGGGASYSLRRCLTGR 

NCEGQNIRYKTCSNHDCPPDAEDFRAQQCSA 

YNDVQYQGHYYEWLPRYNDPAAPCALKCH 

AQGQNLWELAPKVLDGTRCNTDSLDMaSG 

ICQAVGCDRQLGSNAKEDNCGVCAGDGSTC 

RLVRGQSKSHVSPEKREENVIAVPLGSRSVRI 

TVKQPAHUFlESKTLQGSKGEHSFNSPGVrVV 

ENTTVEPQRGSBRQTFKIPGPLMADFlFKniY 

TAAKDSWQFFFYQPISHQWRQTDFFPCTVT 

CGGG 


336 


1686 


A 


30S8 


54 


347 


WGKQEAGAHSDSCXXLHTPPRLTPAHSRKA 
LRNSRIVSQKDDVHVCIMCLRAIMNYQVSRG 
AWDWRLGSPACPHWGLHKLPRLWDPLSLYP 
VLCWGT 


337 


1687 


A 


3059 


2 


709 


ILTSLVELTRFETLTPRFSATVPPCWVEVQQE 

QQQRRHPQHLHQQHHGDAAQHTRTWKLQT 

DSNSWDEHVFELVLPKACMVGHVDFKFVLN 

SNTTNIPQIQVTLLKNKAPGLGKVNGLRLCaT 

LHJHKEDILCGFVWLASQLDLSGHAGMLTLT 

SPKLVKGMAGGKYRSFLIHVKAVNERGTHEl 

CNOGMRPWRLPSLKHQSNKGYSLASLLAK 

VAAGKEKSSNVKNENTSGTRK 


338 


1688 


A 


3060 


85 


384 


KAFYNYHVLELLQMLVTGGVSSQLEQHLDK 
DKVYGVADSCTSLLSGKNRCXLGLLSLHEnL 
SDVNPKNTFGQLFCGSLDLFGlLCVGLYRnDE 
EELNP 


339 


1689, 


A 


3063 


236 


362 


ChLCJLSGUFMVMllU'NVSRRl'OYVAFQNYV 
PSSVTTMLSWV 


340 


1690 


A 


3065 


3 


1249 


DLWQFTPLHEAASKNRVEVCSLLLSYGADPT 

LLNCHNKSAIDLAPTPQLKERLAYEFKGHSLL 

QAAREADVTIUKKHI^LEMVNFKHPQTHETA 

LHCAAASPYPKRKQICELLLRKGANINEKTKB 

FLTHJIVASEKAHNDWEWVKHEAKVNAL 

DNLOQTSLHRAAYCCTILQTCRLLLSYGCDPN 
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SEQ ID 
NO: of 
nud" 
eodde 
seq- 
uence 


SEQ ID 
NO: of 

seq- 
uence 


Met 
hod 


SEQ 
ID NO: 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

oorrespondi 

ng to first 

amino Bcid 

residue of 

peptUe 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alaninc OCystcine, 
D^Aspartic Acid, &<}hitamic Add, 
F=41iaiylalanine, G=Glydne, H=Histidine, 
I^Isoleucine, K=Cysine, L°4.«ucine, 
M^Mefluonine, N°Aq)aragine, P=Proline, 
Q=Glutamine, R=Ai;ginine, S^^erine, 
T=ThrBonine, V=Valine, W=Tiyptophan, 
Y=Tyrosinc, X=UnknowD. *=^>p codon. 
^'possible nucleotide deletion^ \=possible 
nucleotide insertion 














nSLQGFTALQMGNBNVQQLLQEGISLGNSEA 

DRQLLEAAKAGDVETVKKLCTVQSVNCRDIE 

GRQSTTLHFAAGYNRVSWEYLLQHGADVH 

AKDKGGLVPLHNACSYGHYEVAHiVKHGA 

VVNVADLWKFTPLHEAAAKGKYEICKLLLQ 

HGADPTKKNMXJNTPLDLVKDGDTDIQDLLR 

GDAALLDAAKKGC3UARVKKLSSPDNVNCRD 

TQGRHSTPLHLAGK 


341 


1691 


A 


3070 


1 


547 


GVLIPSFQNQLFADILAGIESVTSEHNYQTLIA 

NYNYDRDSEEES\aNLLSYNlDGIILSEKYHn 

RTVKFLRSATIPVVELMDVQGERLEAIEVGFD 

NRQAAFDMVCTMLEKRVRHKILYLGSKDDT 

RDEQRYQGYCDAMMLHNLSPLRMNPRAISSI 

HLRMQLMRDALSANPDLDOVFCTN 


342 


1692 


A 


3073 


463 


3 


RINRCRKPSDADILVPGDTISLIGTTSUUDYNB 

IDDNRVTAEEVDILLREGEKLAPVMAKTRILR 

AYSGVRPLVASDDDPSGRNVSRGIVLXDHAE 

imGLlXJHITITGGKLMTyRLNlAEWATDAVC 

RKLGNTRPCTTADLAUGSQEPAKVP 


343 


1693 


A 


3075 


250 . 


1 


LLIYLAIFAPVAMSALAGVKSVQQVRIRAAQS 

LGASRAQVLWFVILPGALPEILTGUUGLGVG 

WSTLVAAELIAA'IKGLGFM 


344 


1694 


A 


3076 


2 


138 


LYFDAYLQSLQVAAISTTCCLLIGYPLAWAV 
AHSKPSTRNILUX 


345 


1695 


A 


3078 


469 


3 


LKIRGQRIELGEIDRVMQALPDVEQAVTHAC 

VINQAAATGGDARQLVGYLVSQSGLPLDTSA 

U3AQLRETLPPHMVPVVLLQLPQLPUANGKL 

DRKALPLPELKAQAPGRAPKAGSETIIAAAFS 

SLLGCDVQDADADFFALGGHSLLAMKLAT 


346 


1696 


A 


3082 


404 


2 


QNTTSKDLDVRLDPQTVPIELEQLVLSFNHMI 

ERIEDVFTRQSNFSADIAHEIRTPITNLrrQTEI 

ALSQSRSQKELEDVLYSNLEELTRMAKMVSD 

MLFLAQADNNQLIPEKKMLNLAHEVGKVFD 

QFEALPE 


347 


1697 


A 


3084 


3 


340 


NHLIFKEAEISKLYTKVHPAYRTLLEKRQALE 
DEKAKLNGRVTAMPKTQQEIVRLTRDVESGQ 
QVYMQLLNKEQELKirEASTVGDVRrVDPAIT 
QPGVLKPKKGLIILGAI 


J4o 


1698 


A 


3086 


723 


10 


TQAMVWQQKACAEDDP<^GRHWLHAArL 

YNIAAYPHLKGDDLAEQAQALSNRAYEEAA 

QRU-GTMRQMEFTVPGGAPITOFLHMPKGDG 

PFPTVLMCGGLDAMQTDYYSLYERYFAPRGl 

AMLTIDMPSVGFSSKWKLTQDSSLLHQHVLK. 

ALPNVPWVDHTRVAAFGFRFGANVAVRLAY 

LESPRLKAVACLGPWHITXSGLKCQQQVPE 

MYLDYLASRLGMHDASTKSSTRENH 




1 AGO 


A 


3Uo/ 


2 


249 


RJRSSDFErrLAGTPLHAAYLIGMTLICAGFSV 
GFOVAMSQALGPFSLRAGVASSTLGIAQVCG 
SSLWIWLAAWGIGAWNM 


350 


1700 


A 


3099 


3 


424 


EAPEATPQPSQPGPSSPISLSAEEENAEGEVSR 
ANTPDSDrTEKTEDSSVPETPDNERKASlSYFK 

KDTVnVSEPSEDEESQGLPTMARRNDDISELE 
DLSGMEDLK 


351 


1701 


A 


3108 


2 


404 


DCKNHHGYQLLHRRALFEKRTRLSDYALIFG 

MFGIWMVIETELSWGAYYKAPLYSLALKCL 

ISLFTIILLGLTIVYHAREIQLFMANYGADDWR 

SALTYEPIFLILLEALRGVIHATPCRVSLSLWD 

GIX)LP 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

iocatian 

correspondi 

ngto&st 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine 0=Cystcme, 
D=Aspaitic Acid, E^Ghitamic Acid. 
P^TiBnylalanine, G=01ydne, H^Histidine^ 
I=lsoleucine, K=Cysinei, LrCeudne, 
M^Methionine, N=Asparagine, P=Prolinei 
Q^Ghitamine, R^^Aiginine, S=Serine, 
T=Threonin^ V=-Valine, W=Tiyptopijan, 
Y^Tyrosine;, X=Unknown, ♦=Stop codon, 
/=TK>ssible nucleotide deletion, ¥^assible 
nucleotide insertion 


352 


1702 


A 
A 




341 


2 


AQLAEVCPPQTLL 1 rm^JSlSJ lAIAAEDCNPER 
VAGLHFFNPAPVMKLVEWSGLATAAEWE 
QLCELTLSWGKQPVRCHSTPGFIVNRVARPy 
YSEAWRALEEQVAAPEVI 


353 


1703 


A 


3111 


3 


188 


HFSLFRIAFAVFLTYMTVGLPLPVIHLFVHHEL 
GYGNTMVGUVGIQFLATVLTRGYAGRLA 


354 


1704 


A 


3116 


367 


225 


WQUTOJ^Gi'FLNlGE'rDTESCVNGWVYDRSS 
FHPSNMTEVRGLVFLS 


ijj 


1705 


A 


3117 


101 


53 


VINLVYLISSPRPELKPVDKESEWMKFPDGF 

EKFSPPILQLDEVDFYYDPKHVIFSKLSVSADL 

ESRICWGENGAGKSTMLKLLLGM-VAPVRGI 

RHAHRNLKIGYFSQHHVGAAGmPSACGNL 

LGTQVFLGRPEEEY\RHQLGFGMOISGELGHA 

SSLPACLGGQKEAEVAFCSDGLLPCPNFLMLN 

DEPTN\HLGHGRAIEALGPCLQTISGVGVILVS 

HE*SALSRLVCRE\LWVC*GRSTSPF 


356 


1706 


A 


3121 


137 


466 


RGGRDWGEHNQRLEEHQARAWQGAMDAG 
AASREHARWQGTGLAPGTRVAVAPTCVQGL 
PQERSVCRPFFSSRWREOPVWALOAGAHOKP 
RWSGGVRCWRGGRWFTPAHH 


357 


1707 


A 


3124 


1249 


229 


MLEAPGPSDGCELSNPSASRVSCAG<^EVQ 

PGLYFGGAAAVAEPDHLREAGITAVLTVDSE 

EPSFKAGPGVEDLWRLFVPALDKPETDLLSH 

LDRCVAFIGQARAEGRAVLVHCHAGVSRSV 

AUTAFLMKTDQLPFEKAYEKLQILKPEAKMN 

EQFEWQUCLYQAMGYEVDTSSAIYKQYRLQ 

KVTEKYPELQNLPQELFAVDPTTVSQGLKDE 

VLYKCRKCRRSLFRSSSILDHREGSGPIAFAH 

KRMTPSSMLTTGRQAQCTSYFIEPVQWMESA 

LLGVMDGQLLCPKCSAKLGSPNWYGBQCSC 

ORWITPAFQIHKNRVDEMKILPVLGSQTGKI 


358 


1708 


A 


3127 


816 


139 


EVETLGPRTPGP/EAQSPTPGSCPGWQEPSPGP 

TPPP*LSGPGPQGAPVLGKIXPDPEETPAGKTP 

LGKHFWWGL\PVTSANFSPGAAA»FGGALSPP 

GGDL/GHMLLQGPPSPFRLQQQ*Qrn»PGSHSP 

PTANREINPOPAAAADTRSCWOHKRSWRGW 

RGLAPWRLGFGSPGIP*PAPAGIP/GRPTWEGG 

KGAGGKPSETLTRSPPVWRGKRGSANGFLSW 

VQILQ 


359 


1709 


A 


3132 


3 


191 


HEHLLXLLLCVFLVKSQGVNDNEEGFFSARG 
HRPLDKKREDAWLRPALADOTVCDYRAQIA 
♦AASTPKRAASIAHNAVSCR*AQIA 


360 


1710 


A 


3134 


1 


286 


REPPHPALLFF*DRVSLCCPGWNAWQSQLT 
AAPTSQVQ/SDSPTRPSSWDYRHVPEYPANEL 
♦RQGFPMLPRLVSNSWAQTVHPPRPPKVLDL 
QA 


361 


1711 


A 


3135 


56 


1449 


PVPAPRVSPSARGAPGRPRLPGVRGPRHSWA 

AD»RGSRM/PPRAPAPSPTGP/APGGKKVRGR 

VPEDPDAYEPRCSAL*V«PTHVTSPQFCDP*N 

GQIRSYTTVLIJU3LNETMLVK/PLCRREP/PEA 

GPGRQSTPAVTRDHRQHEDPROAGRQWDAD 

PVP^ A9/PA17VA'IV^C!171>m>UXini/kin3¥ A Af\r\ 

1 IvToAr/r/Ul VAl IjoKl^XriMWMKLCLAAQQ 

APGLPHRTSmPGWRRLTEPEAWARRHRRPW 

GQRGAVRPPPQGAAPPPSHQGRRTNTDPSAT 

PRLTVMSRCLAPDLKAPASGPRGWRRGMPQ 

SS/GAIXWTPPPTPRGSHSPRPREAPLRAIHPA 

OPSK/SRAGASGRLPEVIYOWVTLFIPPEAGT 

F/LIPSPT*MSPALVIQPPVPPT(SvlGLRISGIJ»R 

QG»PSGAPW*UPGLAQLAFQCMLPHDEVGPP 
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PCT/DSOl/03800 



SEQID 
NO: of 
nucl- 
eotide 
seq- 
vcnce 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 

BONO: 

in 

USSN 
09/496 

914 


Predicted 

begiiming 

nucleotide 

location 

coneqwndi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
add residue 
of peptide 
sequence 


Amino acid sequraice (A^Alanine OCystdne, 
D=Aspartic Acid, EKSlutamic Acid. 
F^Phenylalanine, G=Glydne, H=Histidine, 
I^lsoleudne, K=Lysine, L=l,eucine, 
M=Metfaioniiie, N^Asparagine, P=Proline, 
Q=Ghitamine; R^Argfaiine, S='Serine, 
T=>Thrconinci V=Valine, W=Tiyptophan. 
Y=Tyrosinc, X=Unlaiown, *=Stop codon, 
A=^ssible nucleotide deletion. V=pa8sible 
nucleotide insotion 














RNQSPLGNDTLSSGLPMGPRRQVWPLARVG 
GHSSPREPQVLKKPLWGQTDIAGVGSASLYP 
DNL 


362 


1712 


A 


3136 


1270 


274 


RVGMVLGTREVGDSTPPPSPPLYPFTGNEFVQ 

HNTWQLSRVYPSDLRTDSSNYNPQELWNAG 

CX3M/V*GGSRDWEEGVEEQQVGNKFSSDGR 

VGECSRKLLG*EMLSVDrrSRYRAPSTyLLNS 

LKEGLEGLHGESCSSFLLGPSVAMNMQTAGL 

EMDICDGHFRQNGGCGYVLKPDFLRDIQSSF 

HPEKPISPFKAQTLLNQVISVQQLPKVDKTKE 

GSIVDPLVKVQIFGVRLDTARQETNYVENNG 

FNPYWGQTLCmVLGPDFPMLKFGKMDYDW 

KSRNDLLGKTPCPGTCMQQGYRHIHLLSKDa 

ISLRPASIFVYICIQEGLEODES 


363 


1713 


C 


3139 


60 


248 


MFAGSYGKSMFSFSKKVLNCLPKWRYHFVIA 
PAMNESPLAPHLHQHLVFSVFQVLTIUGV** 


364 


1714 


A 


3140 


57 


418 


SAFKTLQLPAFSLYFDLGSUCLXILRIHTSIVK 
NHKVESPRTMSPG*DPQSFLQIPQPRPPQLRV 
GLTSGUQHFHSPSSCQFPLLRGPPFPRQPPLGI 
SGASLCPVLSPPR»PLQPSSL 


365 


1715 


A 


3145 


122 


413 


1XPYPSLFVFLRQCHFVT\RLECNGVVSAHCN 
LHLPGSSDSPASAS^VAGTTGVCHHTRUFWF 
LV*TGFHYVAQAGLELLTA»S\PPQLPKWGL 
OA 


366 


1716 


A 


3150 


247 


2 


VGEKLHDERFGNDFDMTPKAQATKEKIDKLN 
FIKIKKLCIEGYY/NREPQNGRKIFANYVS\DK 
GLMATTYEELLKLSNKLIQ 


367 


1717 


A 


31S2 


3 


2367 


QKLKQNQPKRAHVEDGGSRSKQGNEQSKKT 

PIEKSDFAAATHPRAFYLSKPDETPNAWMSD 

SGTGLTYWKLEEKDMHHSLPETLEKTFISLSS 

TDVSPNQVLTLDPTLHMKPKQQISG1QPHGU» 

NALDDRISFSPDSVUEPSMSSPSDIDSFSQASN 

VTSQLPGFPKYPSHTKASPVDSWKNQTFQNE 

SRTSSTFPSVYTrrSNDISVNTVDEENTVMVAS 

ASVSQSQLPGTANSVPECISLTSLEDPVILSKIR 

QNLKEKHARHIADLRAYYESEINSLKQKLEA 

KEISGVEDWKITNQILVDRCGQLDSALHEATS 

RVRTLENKNNLLEIEVNDLRERFSAASSASKI 

LQERIEEMRTSSKEKDNTIIRLKSRLQDLEEAF 

ENAYKLSDDKEAQLKQENKMFQDLLGEYES 

LGKEHRRVKDALNTTENKLLDAYTQISDLKR 

MISKLEAQVKQVHffiNMLSLRHNSRIHVRPS 

RANTIj\TSDVSRRKWLIPGAEYSIFTGQPLDT 

QDSNVDNQLEETCSLGHRSPLEKDSSP/GSSST 

SliKKQRETSDTPttfllALKELDEGKIFKNWG 

TQTEKEDTSNSLL*/INPRQTETSVNASRSPEK 

CAQQRQKRLNSASQRSSSLPPSNRKSSTPTKR 

EIMLTPVTVAYSPKRSPKENLSPGFSHLLSKN 

ESSP1REKTYSEKA1DNHVNHSSCPEPVPNGV 

KKVSVRTAWEKNKSVSYEQCKPVSVTPQGN 

DFEYTAKIRTLAETERFFDELTKEBa)QIEAAL 

aKMrM'UVjKl 1 lAi 1 RL,NQVK.CLSLNLL 


368 


1718 


A 


3163 


2 


2350 


EFKSGGC3GAGLVAAGAVLVLYPASRAGERT 

RVPGSPAPSSLPLHSPGACGTEVDMDPQRSPL 

IJEVKGNIELK3«>LIKAPSQLPLSGSRLKRRPDQ 

MEDGLEPEKKRTRGLGATTKITTSHPRVPSLT 

TVPQTQGQTTAQKVSKKTGPRCSTAIATGLK 

NQKPVPAVPVQKSGTSGVPPMAGQKKFSKRP 

AWDLKOQLCDLNAELKRCRERTQTLDQENQ 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 

seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

bcgjiming 

jDucleotide 

location 

conespondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 

nucleotide 

Incfltion 

A till Uli 

correqnndiDg 
to last amino 
acid residue 
of pepUde 
sequence 


Amino acid sequence (A<=Alanine OCystcinc, 

D=Aspaitic Acid, E=01utamic Acid, 

^ 1 iKiiiy itiitiiiiiLtj^ \j—\jiyiMiCf n — ^nisooniE^ 

I^Isoleudne, K=iLysine, L^Leudne, 
M=^etfaionine, N=Asparagine, P=Proline, 
Q=Gluta2mn^ R^Aiginine, S=Serine, 
T=Threoninc, V=Va]ine, W=Tiyptophan, 
Y=Tyiosinej X=Unknown. ♦=Stop codon, 
^=possible nucleotide deletion, V^ossible 
nucleotide insertion 














QLQDQLRDAQQQVKALGTERTTLEGHLAKV 

QAQAEQGQQELKNLRACVLELEERLSTQEGL 

VQELQKKQVELQEERRGLMSQLEEKERRLQT 

SEAALSSSQAEVASLRQETVAQAALLTEREER 

LHGLEMERRRLHNQLQELKGNIRVFCRVRPV 

LPGEPTPPPGLLLFPSGPGGPSDPPTRLSLSRSD 

ERRGH.SGAPAPPTRHDFSFDRVFPPGSGQDE 

VFEEIAMLVQSALDGYPVaFAyGQTGSGKTF 

TMEGGPGGDPQLEGLIPRALRHLFSVAQELSG 

QGWTYSFVASYVEIYNETVRDLLATGTRKGQ 

UOb^JiiKJKAUrGSEELTVTNARyvPVSCEKEV 

DALLHLARQNRAVARTAQNERSSRSHSVFQL 

QISGEHSSRGLQCGAPLSLVDLAGSERLDPGL 

ALGPGERERLRETQAINSSLSTLOLVIMALSN 

KJsaMVriKrlaKl^TYLLQNSLuGSAKMLMFV 

NISM.EENVSESLNSLRFASICVEPSVLFGTAQS 

NRKWKTDPDLCVCVCVCVCVCVCVCVCVP 

MSMYRVRGGRVAGGCFIGWRAPCPRAIK 


369 


1719 






JO J 




Vj Y 1 J>yVrK W lUUiKUl'L 1 AKl JiSLHENNFNALP 
GYIRKIE*1*IYKKN*INFGGVGLLNIVKISILS/K 
lYRTOAIPVKlLTRFFINLDKLILKFVLKTKlAK 


370 


1720 


A. 


3170 


393 


42 


GASISPSAVIDGVEGUa»MQEQEAQEAGPCLD 
♦HMAPEQWVAPRVRLLFRLIFSVLHALnAAAA 
QSSAEEDEDPRN*GQSSEDQAPNQNGLrVIVH 
KVJtlVrl^oAAAl Vr VtiKoHrrK 


371 


1721 


A 


3173 


770 


310 


GNGGCGLSQIPPSHLGAFSRGSLLSRGXDPRGP 

PPHPVIFFVFWHQGFTVLARMVS1S*PCDPP 

ALASQSAGITGVSHLARPQNLYF 


372 


1722 


A 


3180 


381 


76 


RVLHHDNVPAHSS?QKRE1SQEFQLEIRHLP*S 
PDLAPSOCTU^i«JKJnnQGTHFSLVDNVKK 
TVSTWLH/SQNAQFYKDRLNGWYHCLQKCL 
QHY*AYVEK 


373 


1723 


A 


3181 


410 


14101 


RREVAGPEGKGLLLASAHTMLTPPLLLIXPLL 

SALVAAAIDAPKTCSPKQFACRDQITCISKGW 

RCDGERDCPDGSDEAPEICPQSKAQRCQPNE 

HNCIXJTELCVPMSRLCa^fGVQDCaWDGSDEGP 

HCRELQGNCSRU3CQHHCVPTLDGPTCYCNS 

SFQLQADOKTCKDFDECSVYGTCSQLCTNTD 

GSFIOGCVEGYLLQPDNRSCKAKNEPVDRPP 

VLLIANSQNILATYLSGAQVSnTPTSTRQTTA 

MDFSYANETVCWVHVGDSAAQTQLKCARM 

PGLKGFVDEHTTNISLSLHHVEQMAEDWLTGN 

nrFVDDIDDRlFVCNRNGDTCVTTXDLELyNP 

KGIALDPAMGKVFFTDYGQIPKVERCDMDG 

QNRTKLVDSKIVEPHGrrLDLVSRLVYWADA 

YLDYIEWDYEGKGRQTnQGIUEHLYGLTVF 

ENYLYATNSDNANAQQKTSVIRVNRFNSTEY 

QWTRYDKOGALHIYHQRRQPRVRSHACEN 

DQYGKPGGCSDICLLANSHKARTCRCRSGFS 

LGSDGKSCKKPEHELFLVYGKGRPGIIRGMD 

MGAKVPDEHMIPIENLMNPRALDFHAETGn 

YFADTTSYLIGRQKIDGTERETILKDQIHNVE 

GVAVDWMGDNLYWTDDGPKKTISVARLEK 

AAQTRKTLIEGKMTHPRAIVVDPLNGWMYW 

TDWEEDPKDSRRGRLERAWMDGSHRDIFVT 

SKTVLWPNGLSLDIPAGRLYWVDAFYDRIETI 

LLNGTDRKIVYEGPELNHAFGLCHHGNYLFW 

TEYRSGSVYRLERGVGGAPPTVTLLRSEVRPPI 
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SEQ ID 
NO: of 

Duel* 

eotide 
seq- 
uence 


SEQ ID 
NO: of 

seq- 
uence 


Met 
hod 


SEQ 
ID NO: 

ill 

USSN 
09/496 
914 


Predicted 

banning 

nucleotide 

location 

coiTC^ondi 

ng to first 

amino acid 

residue of 

pqjtide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A'=Alaninc OCysteinc, 
D=Aspaitic Acid, EKlhitamic Add. 
F'^Phenylalanine; G=Glydnei H=Hist)dine, 
I=Isoleucine, K=i.ysine, L=Leucine, 
M=Metfaiontne, N=As{»ragine, P=Proline, 
QKjlutamine, R=Argiiiine, S°^ain^ 
T-Threonine, V=Valine, W=Tiyptophan, 
Y=Tyrosine, X=UnIaiown. *=«top codon. 
A=possible nudeolide deletion, Hiossible 
nucleotide insertion 














FEIRVMYDAQHQQVGSNKCRVNNAGCSSLCL 

ATPGSRQCACAEDQVLDADGVTCLANPSYVP 

PPQCQPGEFACANSRCIQERWKCDGDNDCLD 

NSDEAPALCHQHTCPSDRFKCENNRCIPNRW 

LOXjDNDCGNSEDESNATCSARTCPPNQFSC 

ASGRCIPISWTCDIDDDCGDRSDESASCAYPT 

CFPLTQFTCNNGRONINWRCDNDNDCGDNS 

DEAGCSHSCSSTQFKCNSGRCIPEHWTCDOD 

NDCGDYSDETHANCTNQATRPPGGCHTDEF 

QCRLDGLCIMJRWRCDGDTDCMDSSDEKSCE 

GVTHVCDPSVKFGCKDSARCISKAWVCDGD 

NDCEDNSDEENCESLACRPPSHPCANNTSVC 

LPPDiaaXjENDDCGDGSDEGELCDQCSLNN 

GGCSHNCSVAPGEGFVCSCPLGMELGPDNHT 

CQIQSYCAKHLKCSQKCDQNKFSVKCSCYEG 

WVLEPDGESCRSLDPFKPFIIFSNRHEIRRIDLH 

KGDYSVLVPGLRNTIALDFHLSQSALYWTDV 

VEDKIYRGKLLDNGALTSFEWIQYGLATPEG 

LAVDWIAGNIYWVESNLDQIEVAXLDGTLRT 

TLLAGDIEHPRAIALDPRDGILFWTDWDASLP 

RIEAASMSGAORRTVHRETGSGOWPNGLTV 

DYLEKRILWIDARSDAIYSARYDGSGHMEVL 

RGHEFLSHPFAVTLYGCTVYWTDWRTNTLA 

KANKWTGHNVTWQRTNTQPFDLQVYHPSR 

QPMAPNPCEANGGQGPCSHLCLINYNRTVSC 

ACPHLMKLHKDNTTCYEFKKFLLYARQMEIR 

GVDLDAPYYNYnSFTVPDIDNVTVLDYDARB 

QRVYWSDVRTQADCRAFINGTGVETWSADL 

PNAHGLAVDWVSRNLFWTSYDTNKKQINVA 

RLDGSFKNAWQGLEQPHGLWHPLRGKLY 

WTDGDNISMANMDGSNRTLLFSGQKGPVGL 

AIDFPESKLYWISSGNHTINRCNLDGSGLEVID 

AMRSQLGKATALAIMGDKLWWADQVSEKM 

GTCSKADGSGSVYLRNSTTLVMHMKVYDESI 

QLDHKGTNPCSVNNGDCSQLOLP'I SEIURSC 

MCTAGYSLRSGQQACEGVGSFLLYSVHBGIR 

GffLDPNDKSDALVPVSGTSLAVGIDEHAEND 

TIYWVDMGLSTISRAKRDQTWREDVVmGIG 

RVEGIAVDWIAGNIYWTDQGKDVIEVARLNG 

SFRYWISQOLDKPRAITVHPEKOYLFWTEW 

GQYPRIHISRLDGTERWLVNVSISWPNGISV 

DYQDGKLYWCDARTDKIERIDLETGENREW 

I^SNNMDMFSVSVFHOFIYWSDRTHANGSIK 

RGSKDNATDSVPLRTGIGVQLKDKVFNRDR 

QKOTOVCAVANGGCQQLCLYRORGQRACA 

CAHOMLAEDGASCREYAGYLLYSERTILKSI 

HLSDERNUMPVQPFEDPHIMKNVIALAFDY 

RAGTSPGTPNRlFFSDlHFGNIQQINDDGSRRrr 

IVENVGSVEGLAYHRGWDTLYWTSYTTSTIT 

RHTVDQTRPGAFERETVITMSGDDHPRAFVL 

DECQNIJMFWTNWNEQHPSIMRAALSGANVL 

TLIEKDIRTPNGLAIDHRAEKLYFSDATLDKIE 

WTDWVRRAVC^IANKHVGSNMKLLRVDIPQ 

QPMGIIAVANDTNSCELSPCRINNGGCQDLCL 

LTHQGHVNCSCRG<»ILQDDLTCRAVNSSCR 

AQDEFECANGEONFSLTCDGVPHCKDKSDE 

KPSYCNSRRCKKTFRQCSNGRCVSNMLWCN 

GADDCGDGSDECPCNKTACGVGEFRCRDGTC 

IGNSSRCNQFVDCEDASDEMNCSATDCSSYF 
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SEQID 
Naof 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
pqitide 
seq- 
uence . 


Met 
hod 


SEQ 
ED NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

conespondi 

ngtofiist 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
coiieqxmding 
to last amino 
add residue 
of peptide 
sequence 


Amino acid sequence (A^Alanine 0~Cysteine, 
I>>Aspartic Acid, E=Qlutamic Acid, 
F^Phenylalanine, G=Glycine, H=Histidine, 
I=Isoleucine, K=Lysine, IM^cine, 
M<=Meifaionine, N^^Aspaiagine, F^Fiolinej 
Q^Ghdamine, R=ATgintne, S=Serine^ 
T^Threonine, V=Valine, W=Tiyptophan, 
Y=Tyn>sine, X'^Unknown, *=Stop codon, 
/^possible nucleotide deletion, \=possibIe 
nucleotide insertion 














RLGVKGVLFQPCERTSLCYAPSWVCDGAND 

CGDYSDERDCPGVKRPRCPLNYFAO'SGRCIP 

MSWTCDKEDDCEHGEDETHCNKFCSEAQFE 

CQNHRCISKQWLCDGSDDCGDGSDEAAHCE 

GKTCGPSSFSCPGTHVCVPERWLCDGDKDCA 

DGADESIAAGCLYNSTCDDREFMCQNRQCIP 

KHFVCDHDRDCADGSDESPECEYPTCGPSEF 

RCANGRCLSSRQWECDGENDCHDQSDEAPK 

NPHCrsraHKCNASSQFLCSSORCVAEALLCN 

GQDDCODSSDERQCHINECLSRKLSGCSQDC 

EDLKIGFKCRCRPGFRUKDDGRTCADVDECS 

TTFPCSQRCINTHGSYKCLCVEGYAPRGGDP 

HSCJCAVTDEEPFLIFANRYYLRKLNLDGSNY 

TLLKQGLNNAVALDFDYREQMIYWlDVTrQ 

GSMIRRMHLNGSNVQVLHRTGLSNPDGLAV 

DWVGGNLYWCDKGRDTIEVSKLNaAYRTVL 

VSSGLREPRALWDVQNGYLYWTDWGDHSL 

IGRIGMDGSSRSVrVDTKJTWPNGLTLDYVTE 

RIYWADAREDYDEFASLDGSNRHWLSQDIPH 

IFALTLFEDYVYWTDWETKSINRAHKITGTN 

KTLLISTLHRPMDLHVFHALRQPDVPNHPCK 

VNNGGCSNLCLLSPGGGHKCACPTNFYLGSD 

GRTCVSNCTASQFVCKNDKCIPFWWKCDTE 

DDCGDHSDEPPDCPEFKCaiPGQFQCSrGICTN 

PAFICDGDNDCQDNSDEANCDIHVCLPSQFK 

CTNTNRCIPGIFRCNGQDNCGDGEDERDCPE 

VTCAPNQFQCSmKRCIPRVWVCDRDNDCVD 

GSDEPANCTQMTCOVDEFRCKDSGRCIPARW 

KCDGEDDCGDGSDEPKEECDERTCEPYQFRC 

KNNRCVPGRWQC3)YDNDCGDNSDEESCXPR 

PCSESEFSC/WGRCIAGRWKCDGDHDCADGS 

DEKDCIPRCDMDQFQCKSGHCEPLRWRCDA 

DADCMDGSDEEACGTGVRTCP1J3EFQCNNT 

LCKPLAWKOXJEDDCGDNSDENPEECARFV 

CPPNRPFRCKNDRVCLWIGRQCaXJTDNCOD 

GTDEEDCEPPTAHTTHCKDKKEFLCRNQRCL 

SSSLRCmiFDDCGDGSDEEDCSIDPKJLTSCAT 

NASICGDEARCVRTEKAAYCACRSGFHTVPG 

QPGCQDINECLRFGTCSQLCNNTKGGHLCSC 

ARNFMKTHNTCKAEGSEYQVLYIADDNEIRS 

LFPGHPHSAYEQAFQGDESVRIDAMDVHVKA 

GRVYWTNWHTGTISYRSLPPAAPPTTSNRHR 

RQEDRGVTHLNISGLKMPRGIAIDWVAGNVY 

WTDSGRDVIEVAQMKGENRKTLISGMIDHW 

AIWDPLRGTMYWSDWGNHPKIETAAMDGT 

LRETLVQDNIQWPTGLAVDYHNERLYWADA 

KLSVIOSIRLNOTDPIVAADSKRGLSHPFSIDV 

FEDYIYGVTYINNRVFKIHKFGHSPLVNLTGG 

LSHASDWLYHQHKQPEVTNPCDRKKCEWL 

CLLSPSGPVCTCPNGKRLDNGTCVPVPSPTPP 

PDAPRPGTCNLQCFNGGSCFLNARRQPKCRC 

QPRYTGDKCELDQCWEHCRNGGTCAASPSO 

MPTCRCPTGFTGPKCTQQVCAGYCANNSTCT 

VNQGNQPQCRCLPGFLGDRCQYRQCSGYCE 

NFGTCQMAADGSRQCRCTAYFEGSRCEVNK 

CSRCLEGACWNKQSGDVTCNCTDGRVAPS 

CLTCVGHCSNGGSCIMNSKMMPECQCPPHM 

TGPRCEEHWSQQQPGfflASnjPLLLLUJLVL 

VAOWFWYKRRVQGAKGFQHQRMTNGAM 

NVBIGNPTlfKMYEGGEPDDVGGLLDADFAL 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

conespottdi 

ng to first 

amino add 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
coirespojiding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A-Alanine OCysteine, 
D-Aspartic Add, E-Ghitamic Add, 
F=Phenylalaninc, G=Glydne, H=4Jistidine, 
I=IsoIeucine, K=Lysinei L^Loidne; 
MHMethionine, N=AspaTagine, P^ProIine, 
(j=Glutamine, R^^Argiiune, S=Serine, 
T=nueonine, V=Valine, W=Tiyptophan, 
Y=Tyrosine!, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 














DPDKPTNFTNPVYATLYMGGHGSRHSLASTD 
EKRELLGRGPEDEIGDPLA 


374 


1724 


A 


3187 


191 


181S 


C1£LASAGKIPEESKALSLLAPAPTMTSLMPG 

AGLLPEPTPNPLTTLGVSLSSLGAIPAAALDPNI 

ATLGEIPQPPLMGNVDPSKIDEIRRTVYVGNL 

NSQTTTADQLLEFFKQVGEVKFVRMAGDEr 

QPTRFAFVEFADQNSVPRALAFNGVMFGDRP 

LKINHSNNAIVKPPEMTPQAAAKELEEVMKR 

VREAQSFISAAIEPaWLHSTSLCNDFLQCF*RR 

RMYRE*APCTICGTFHLCLIINWDL»LF*AYTA 

K*FFPPRVWKEQ*KKRR\RSRSHTRSKSRSSSK 

SHSRRKRSQSKHRSRSHNRSRSRQKDRRRSK 

SPHRKRSKSRERRKSRSRSHSRDKRKDTREKI 

KEKERVKEKDREKEREREKEREKEKERGKN 

KDRDKEREKDREKDKEKDREREREKEHEKD 

RDKEKEKEQDKEKEREKDRSKBIDEKRJtKDK 

KSRTPPRSYNASRRSRSSSRERRRRRSRSSSRS 

PRTSKTIKRKSSRSPSPRSRNKKDKKREKHU) 

mSERRERERSTSMRKSSNDRDGKEKLEKNST 

S 


375 


1725 


A 


3192 


415 


101 


AHSSHQTRAILQEFQWDIIRHFPL\SPNLAI^G 
FVFPNLKKSLRGTHFSSVKKVTTLTWLNSQDP 
WF/FFYP*SPDLQIPSSFRNOLNDWYHHSQKC 
PDUXjAYVKK 


376 


1726 


A 


3199 


931 


418 


GV*WCDLGSPQPPPPGFKQFCLGRSSSWDYR 
HVPPHPANFVFLLETGFLHAGQAGLXGDPPAS 
ASQSAGITGVSHTWPKNHLIFYACLVIRSKRI 
K 


377 


1727 


A 


3201 


274 


1285 


KTGYTSRGSPLSPQSSIDSELSTSELEDDSISM 

GYKLQDLTDVQIMARLQEESLRQDYASTSAS 

VSRHSSSVSLSSGKKGTCSDQEYDQYSLEDffi 

EFDHLPPPQPRLRRCSPFQRGIPHSQTrFSSIREC 

RRSPSSQYFPSNNYQQQQYYSPQAQIPDQQP 

NRTNQDK/PPKKYA*PSPDAKYNCH»»QH\SSP 

VTVRNSQSFDSSLHGAGNGISRIQSCIPSPGQL 

QHRVHSVGHFPVSIRQPLKATAYVSPTVQGSS 

NMPLSNGLQLYSNTGIPTPNKAAASGIMGRS 

ALPRPSLAINGSNLPRSKIAQPVRSFLQPPJCFL 

SSLSTLRDGNWRDOCY 


378 


1728 


A 


3202 


112 


1789 


VFGVTESRPSVLRGDHLFALLSSETHQEDPir 

YKGFVHKWELDRVKLSFSMSLLSRFVGWG* 

PFKVNFYnTNRQPOmQHftALELTGRWlXW 

PMLFPWAPRDVPLLPSDVKLKLYDRSLESNP 

EQLQAMRHIVTOTIRPAPYIIFGPPOTQKTVT 

LVEAKQWKHLPKAHILACAPSNSOADLLC 

QRIJlVHlJPSSrrRIXAPSRDIRMVPEDDCPCCN 

WDAKKGEYVFPAKKKLQEYRVLmnLITAGR 

LVSAQFPIDHFTHIFIDEAGHCMEPESLVAIAG 

LMEVKETGDPGGQLVLACHJPRQLGFVLRSPL 

TQKHGLGYSLLERLLTYNSLYKKGPDOYDPQ 

FmCLLRNYRSHPTILDIPNQLYYEGELQACA 

DWDRERTCRWAGVLPRQGFPIIFHGVMGKD 

EREGNSPSFIWEEAATVTSYLKLLLAPSSKK 

GKARLSPRSVGYlSPYRKQVEKIRYCnKLDR 

EUIGLDDIKDLKVTCCSTVTPCLPCAPTCPLP 

ETSSSFHSSPRHRPTPAALNRARALPEPLTPGD 

SNLRVWDGIRKPACLTNTSCHS 


379 


1729 


A 


3206 


432 


130 


PKAAPSVXLWFPPFL»GSFKPTKGlfrXCVXIK 
♦LSTRBAXDSXPGRQIAXXRQGGKVEnTAL 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
. seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

locatitm 

coirespondi 

ngtofiist 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino add sequence (A^Alanine OO^steine, 
D=Aspartic Add, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine, IHIistidine, 
I=lsoleudne, K=Lysine, L=4jeucine, 
M=^etliionine, N=Asparagine, P=Proline, 
Q=Glutaininei R=Argbine, Sr^erine, 
■r=Threonine, V-Valine, W=Tiyptophan, 
Y=TjTOsine, X=Unknown, ♦=Stop codon, 
/=p<issible nucleotide ddetion, V=possibte 
nucleotide insertion 














XKQSNNKGTRASSYXBPDAXEQWKFPHKKL 
QLPGXTHE 


380 


1730 


A 


3207 


187 


507 


GOTOHPHPARPPLSOVGGCXJCSHSKPWTAGS 
PEQRDHPAPHKQIEAGQGLPGPQAWGG*KGP 
AXLLPGPGGGPGFVASLEARAQASSGVTPNG 
GGRTyPYPTFSSGE 


381 


1731 


A 


3225 


1 


840 


GTRPGHLPAPSDQFCV/HL*SIPSWGSF*QESL/ 

EMQLITSLGLQEFDIARNVLELIYAQTLVWIGI 

ITCPUJ>FlQMI\CLFIMFirSKNlSLMMNFQPPS 

KAWRASQMMTFFIFLLFn^FTGVLCrLArn 

WRIJCPSADCGPFRGLPLFIHSIYSWIDTLSTIIP 

GYLWVVWryittrtJGSVHFFFlLTLmJITYLY 

WQITEGRKIMIMXHEQIINEGKDKMFLIEKLI 

KLQDMEKKANPSSLVLERREVEQQGFLHLGE 

HDGSLDLRSRRSVQEGNPRA 


382 


1732 


A 


3238 


256 


38 


LLMKVSSTCFSCHLHHHHHHHHRHHQGHNS 
LFFSLKSSSNSSTLPVYLSYNIILVFSKCLVFDF 
LFSNACL 


383 


1733 


A 


3241 


1542 


343 


KGAPSFVRLYQYPNFAGPHAALANKSFFKAD 

KVTMLWNKKATAVLVIASTDVDKTGASYYG 

EQTLHYIATNGESAWQLPKNGPIYDWWNS 

SSTEFCAVYGFMPAKATIFNLKCDPVFDFGTG 

PRNAAYYSPHGHILVLAGFGNLILQ1*AD/IMK 

VWNVKNYKUSKPVASDSTYFAWCPDGEHIL 

TATCAPRLRVNNGYKIWHYTOSILHKYDVPS 

NAELWQVSWQPFLDGIFPAKTrTYQAVPSEVP 

NHEPKVATAYRPPALRNKPITNSKLHEEEPPQ 

NMKPQSGNDKPLSKTALKNQRKHEAKKAAK 

QEARSDKSPDLAPTPAPQSTPRNTVSQSISGDP 

EIDKKIKNLKKKLKAIEQLKEQAATGKQLEK 

NQLEKIQKETALLQELEDLELGI 


384 


1734 


A 


3242 


3 


678 


mSPAARSPGLETPTCLLFVlAAIAAVFVDSAIP 

KLTQHRPQDGSFPYTILDPPLYLPGQCAPPQP 

LSQCARRVHGEKLRRPTFGPRHRGAGTAICMS 

ASLVRATVRAVSKRKLQPTRAALTLTPSAVN 

KKQLLKDKPEHVOVKVOVRTRGCNGLSYTL 

BYTKIXGDSDEEVIQDGVRVFIEKKAQLTLL 

GTE^ffiYVEDKLSSEFVF^^NPNIKGTCGCGES 

FNI 


385 


1735 


A 


3243 


3190 


664 


VAMGTPRAQHPPPPQLLFLILLSCPWIQGLPL 

KEEEttJ'EPGSETPTVASEALAELLHGALLRR 

OPEMGYLPGPPLGPEGGEEETTnm-rnVlT 

TVTSPVLQtNNISEGEGYVESPDLGSPVSllTL 

GLLDCTYSIHVYPGYGIEIQVQTLNLSQEEELL 

VLAGGGSPGLAPRLLANSSMLGEGQVLRSPT 

NRLLLHFQSPRVPRGCKjFRIHYQAYLLSCGFP 

PRPAHGDVSVTDLHPOOTATFHCDSOYQLQO 

EETUCLNGTRPSWNGETPSCMASCGGTIHNA 

TLGRIVSPEPGGAVGPNLTCRWVIEAAEGRRL 

HLHFERVSLDEDNDRLMVRSGGSPLSPVIYDS 

DMDDVPERjGLISDAQSLYVELLSBTPANPLLL 

SLRFEAFEEDRCFAPFLAHGNVTTTDPEYRPG 

ALATTSCLPOYALEPPGPPKAIECVDPTEPHW 

NDTEPACXAMCGGELSEPAGWLSPDWPQS 

YSPGQDCVWGVHVQEEKRILLQVEILNVREG 

DMLTLFDGDGPSARVLAQLRCHKJPRRRLLSS 

GPDLTLQFQAPPGPPNPGLGQGFVLHFKEVPR 

NDTCPELPPPEWGWRTASHGDLIRGTVLTYQ 

CEPGYELLQSDILTajWDLSWSAAPPACQKI 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ BO 
NO: of 
Deptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino add 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 

iiwrfii mil 

coiresponding 
to last amino 
acid residue 
of peptide 
scquace 


Amino acid sequence (A=Alanine OCysteine, 
D=Aspartic Acid, EFOhitamic Add, 
^ uciijruiuuunc, ur=uiyciue, n^msuauie, 
I=Isoleucine, K=Lysine^ I^Leudne, 
N^Mettuooine, N=Asparagine, INProline, 
Q=Ghitamine, R^Aiginine, S=Serine, 
T=Threonine, V=Valinei W=Tiyptophan, 
Y=Tyn)sine^ X^Unknown. ♦=Stop codon, 
A=possible tnicleotide deletion, V^ossible 














MTCADPGBMNGHRTASDAGFPVGSHVQYRC 

LPGYSLEGAAMLTCYSRDTGTPKWSDRVPKC 

ALKYEPCXNPOVPENQYQTLYKHHVQAGESL 

RITCYEGFELIGBVTTrcVPOHPSQWTSQPPLC 

KVTQTTOPSRQLJEGGNLALAnjJLGLVIVLG 

SGVYnrYTKLQGKSLFGFSGSHSYSPirVESDF 

SNPLYEAGDTREYEVSI 


386 


1736 


A 


3250 


5725 


3984 


GTSTVTMATKKHFSIDJ«XGMLLKKDNQDT 

RKLLNnWAlJBVA\rVTvIKKSETYAPLFCU>SF 

HKFCXGLLA0TLVEDVNICLQACSSLHALSSS 

LPDDIXQRCVDVCRVQLVHRGTCIRQAFGKL 

iJCSIPIX>VFLSNNNHTEIQElSLALRSHMSKAP 

SNTFHPQDFSDAaSFlLYGNSHRTGKDNWLE 

RLFYSCQRLDKRDQSnPRNLLKTDAVLWQW 

AIWEAAQFrVLSKLRTPLGRAQDTFQTIEGlIR 

SLAGHTLNPDQDVSQWTTADNDEGHGNNQL 

RLVLLLQYLENLEKLMYNAYEGCANALTSPP 

KVIRTFLYTNRQTCXJDWLTRIRLSIMRVGLLA 

GQPAVTVRHGFDLLXaiKTTSLSQGNELEVSI 

MMWEALCELHCPEAIQGIAVWSSSIVOKHL 

LWINSVAQQAEGRFEKASVBYQEHLCAMTG 

VDCaSSFDKSVLTLASAGCKSASLKHCLNGE 

bRKiVLSKPTDSSPEVINYLGNKACECYISTA 

DWAAVQEWQNAIHDLKKSTSSTSLNLKADF 

NYKSLSSFESGKFVECTEQLELLPGENINLLA 


387 


1737 


A 


3255 


380 


76 


MDIFLYNCKYQVQTEr*NSIQHIMA\SKKLSRF 
LJCYVHNL*AENYKTLMK*INEDLNKQRDVPY 
S»TARl>ira^IPTKTIFRFKAIYIKIPATYFIET 


388 


1738 


A 


3260 


685 


428 


PQWLOLQVYALPPANFVKFVEMRSTILAQTO 
FELLDSSDLPASASKSAGrrCMSHHARTLSLK 
*WPFCLSATQEKFC*PASEGVAW 


389 


1739 






1 
1 




1 T^^^V LJ'I'UTVlilT "KTDTTDT tsA AT fcTTOFVrtTV^TT* T 1*1 

LJLKj 1 rl 1 r X 1 MI^NKJUKLr AAL.'lloDQTGHALTI 
LTRLETQMINADYQNKLTLDYLLITDREVYE 
PFNLTNYCIJnHNQRLGAYDLG*V*Q/KLA)HV 
PVQV♦H0FDPEA^4FR 


390 


1740 


A 


3270 


2 


372 


GRCHDQNKGKS\DGPDAQAEACGGESTYQEL 

SPRGVKKVHKFVNKGEKGIMVLAGDrLGIGV 
Y(XLPCMC*DRKLTYAHIPSTroLGAGAGY 


391 


1741 


A 


3273 


1 


187 


WQEMLDIMKAISDMMGKCTYPVLKEDAPRQ 

M3SKESKGK»FFQEMLDIMKAISr»lMGKCrY 

PVLKEDAPRQHVETFFQVGINQKSRGHEVRR 

KFPDVCHAPR 


392 


1742 


A 


3281 


901 


521 


FFFGDGVSPCRQAOV»WHDLDSLQNLPPGFK 

FTMLARMVSIS'PRDU'ALASQSAGITGVSHH 
APPQMDFITALLCTALKGCXPRQKEGGTLNLI 


393 


1743 


A 


3283 


385 


3 


RNRSVVPEFVIXGLSAGPQTQTLLFVLFVVIC 
LLTVMONIXLLVVmADSClJffrPMYFFLGQL 
SFLDLCHSSVTAPKLLENLLSEKKTISVEGCM 
A»VFFVFATGGTESSLLAVMAYDRYVAIRTR 
G 


394 


1744 


A 


3284 


575 


1054 


CTKCaCAIXmaFNKNFC'rKtKSGFYLHLGKC 
LDNCPEQLEANNHTMECVSIVHCEVSEWNP 
WSPCTKKGKTCGFKRGTETRVREnQHPSAKG 
NLCPPTNETRKCTVQRKKCQKGERGKKGRE 
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SEQ ID 
NO. of 
nucl- 
eotide 
seqr 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 

in 

USSN 
09/496 
914 


Predicted 

begimung 

nucleoti(te 

locaticm 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
correq;>onding 
to last amino 
add residue 
of peptide 
sequence 


Amino add sequence (A=Alanine OCystcine, 
I>=Aq)aitic Add. ENjIutamic Add, 
F=Phenyl8lanine, GMjIycine, IHSstidine, 
l=Isoleudne, K=L)'sine, L=<^cine; 
M=Metfaionine, N=AspaiBgine, P=Proline, 
Q-Glutamine, R^Arginine, S-Scrin^ 
T=Thrconinc, V=Valine, W=Tiyptoiftan, 
Y=Tynisine, X=Unknown, ♦=Stop codon, 
/=possible nucleotide deletion, \=^ssible 
nucleotide insertion 














RKRKKPNKGESKEAIPDSKSLESSKEIPEQREN 
KQQQ 


395 


1745 


A 


3286 


1 


340 


RVLYVPSMGFCILVAHGWQKISTKSVFKKLS 
WICLSMVILTHSLKTFHRNWDWESEYTLFMS 
ALKVNKNNAKLWNNVGHALENEKNFERAL 
KYFLQATHVQPDDIGAHMNVGR 


396 


1746 


A 


3293 


1 


172 


GFRAWMTVKTEAAKGTLTYSRMRGMVAIL 
lAFMKQRRMGLNDFIQKIANNSYACKQ 


397 


1747 


A 


3295 


12 


401 


AEPACGASSCTPPSLRSSSSQSVGPLBPGRPL 

WSEACAFL»AAAPQGPASPCCGLPSGFPRVW 

AQCCPPGGALRFPEGLGSVLSPRRCaKJVSRGS 

GLSAVPQEVPSGFLGPGLRACPQEAPSRFLRA 

GLT 


398 


I74« 


A 


3300 


1912 


2768 


KQRRWQNIQRKGPKRYIVIAGNSQSHQPMIFS 

MLRKLPKVTCRDVLPEIRAICIEEIGCWMQSy 

STSFLTOSYLKYIGWTLHDKHREVRVKCVKA 

LKGLYGNRDLTARLELFTGRFFa)WMVSMIV 

DREYSVAVEAVRLLILEJCNMEOVLMDVDCE 

SVYPIV*ASN*GLASAVGEFLYWKLFYPECEI 

RTMGGREQRQSPGAQRTFFQLLLSFFVESKSH 

SVTQAGVQWQFSAHRDLOJPGSSNSHVSASR 

VAGIAGAHRHTWLIYVFFSWRQGFAVLAGL 

VSNS 


399 


1749 


A 


3301 


536 


2391 


LRSYGCKAPSRISHLHK\FLFLLLPSLLMGYSE 

SPPPITDSWAPFISLTHHVLSQSQSPLSSNCWI 

CLSTHTQ»FTALPADLLTWTQSNVSLHISYLA1 

PFLADSFLKPV/L*PGNSAKHLSFKLSSLSMVS 

GRAVALLHUASGLTSIQTNTASSKPPJWGYU. 

STQTSFISPPPLCLSRTYPNPAHATMVGQVPQ 

SLCGLIFTL/R'n'CRPSILHPNYKlISTSAWQKV 

LCFSGSPnHTSLHLTTGSSFLSFHPIPGFPAAN 

SALYVSSlJCGPPGKNVTIPSPVTGT»QPraRGS 

N/RLTVDKDNFFLSPKPNSLHQLPSQ\TPYQAL 

TGAAI^OSYPIWENENTLSWLPITTYNFCLST 

PSLFFLCDTN*YLCLPANWSGTCrLVFQAPTI 

NHJPNQTILISVEASISSSPIRNKWALHLITLLT 

GLGITAALGTGIAGirrSrrSYQTLFTTLSNTVE 

dmhtsrrslqrqldflvgvilqnwrvujllt 
tekggtciylqeeccfcvnesgivhiavrrlh 
draah.»hqvadswwqgssllrwipwvapf 
lgpuflflllmigpcifnlvsrfisqrijfcnq 
asmqkhidnifhlchv*yqslrgnhseApepr 

P 


400 


1750 


A 


3303 


2 


453 


THWRHSSGVPGSTTARRRRRH£IATSDNQE 

YYfJRIXQEVTNRHlNDQKMLADLDDLNRTK 

KYLEERLIELLRDKDALWQKSDALEFQQKLS 

AEERWLGDTEANHCLDCKREFSWMVRRHHC 

RICORIFCYYCCNNYVLSKHGGKKERCX: 


401 


1751 


A 


3304 


1 


626 


MAPQHSSLDDKVPQQASTVCFEFQDILQHSQ 

CTEHKDSLWGPGARSQPFGAHNTRLSPDSCP 

EKIVLRALKDSRAGMPEQDKDPGVQENPDD 

QRRVPQGTODAPSAFRPLWDNOGLSPFVSRP 

GPLERDLHAQRSEVTYNQRSQSSWMSSFPKR 

NAFVSPYSSMGQAQP/GLPKTNPIGESCCWEG 

LSLSTQILG*QKPSKYIPSLCKR 


402 


1752 


A 


3305 


1678 


172 


MELPSGPGRERLFDSHRLPGDO^-LLVLLLYA 
PVGFCLLVLRLFLGIHVFLVSCALPDSVLRRF 
WRTMCAVLOLVARQEDSOLRDHSVRVLISN 
HVTPFDHNIVNLLTTCSTVSESEAESATaRFP 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
p^tide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ngtofiist 

amino acid 

residnc of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
add residue 
of peptide 
sequence 


Amino add sequence (A^Alaninc C=CystBtne, 
I>=Aspartic Acid, EKJlutamic Add, 
F^Phenylalanine, G=Glycine, H=Histidine; 
]=Isoleucine, K=Lysine, L^ieucine, 
M^^eOiionine, N^Aspar^gine, P=ProIine, 
QKjlutamine, R=Argfaine, S=5erine, 
T-Threonine, V=Valine, W=Tiyptophan, 
Y=Tyrosine, X^Unknown. ♦=Stop codon, 
/^possible nucleotide deletion, V=possible 
nudeotide insertion 














GAQIXAH.SPLAFRMEDTEALPLTPILYPTCQ 

FFFFUFLNIFLLAFSSPGSQPLLNSPPSFVCWSR 

GFMHvINGRGELVESLKRFCASrrRLPPTPLLLF 

PEEEATNGREGIXRFSSWPFSIQDWQPLTLQ 

VQRTLVSVTVSDASWVSELLVWSLFVPFTVY 

QVRWLRPVHRQUJEANEEFALRVQQVLVAKE 

LG\QTGTRLTPAVDKAEHMKRQRHPR\LRPQS 

AQSSFPPSPWVLSS/SDVQTGQTLGFREFKESF 

CmVAIOVFIPERPWPKTGCCKTLTIHLILL+G 

GPVSFSCre\DIHPRGT*VPTQQASGLPSFPSYG 

PARGGVL*HPSAQQPLTFAVKSSVWARAGRAL 

QERKQVALYEYARRRFTERRAPGGLD 


403 


1753 


A 


3307 


44 


447 


DPSPSLLAVALGLRAGERTRSGPGSSSPSGGIS 

GGASAGLASSPECACGRSHFTCAVSALGECT 

CIPAQWQCDGDNDCGDHSDEDGCILPTCSPL 

DFHCDNGKCIRRSWVCDSDNDCEDDSDEQD 

CPPRECHED 


404 


1754 


A 


3311 


409 


1 


PRHGWGRRVLGRDRPRLQKVKKSVKAIYIPG 
QDHVQNBEIYARVLDKFGSNFLSRDNADLGT 
AFVKFSTLTK.^LSALLKNLLQGLSRNVIFTLDS 
LLKGDLKGVKGDLKKPFDKAWKDYETKFAK 
lEKEKRERJEWR 


405 


1755 


A 


3322 


12 


458 


AAVPVENPWDDPRYRPRVRll'-lWKDClAGQA 

KVLCNDSYGVTIDWSPKGAFIRLTSQSVGNG 

HPASKEbTOQMVDTDCNTTKVFirWTYGDMVE 

PIU^MIRPAVGAKHKELWKILMALKKIKaWE 

GKYTKPSQYNPNYMLELAHNDSVW 


406 


1756 


A 


3324 


1 


426 


LSMLSTISTEHRLSVLWPIWYCCHCPTHLSAV 
MCVLLWALSLLQSILEWMFCSFLFSDVDSDN 
WCQILDFLTAVWLIFLlVLVLCGFrLVLLVRnC 
GSQKMPLTRLYVTILLTGLVFLFCSU1.S1Q*F 
LLYWIEKDLDDL 


407 


1757 


A 


3328 


213 


1841 


SGDLSPAELMMLTIGDVIKQLIEAHEQGKDID 

LNKVKTKTAAKYGLSAQPRLVDIIAAVPPQY 

RKVLMPKLKAKPlRTASGIAVVAVMCKraRC 

PmSFTGNlCVYCPOOPDSDFEYSTQSYTOYEP 

TSMRAIRARYDPFLQTRHRIEQLKQLGHSVD 

KVEFIYMGGTFMALPEEYRDYFIRNLHDALS 

GHTSNNIYEAVKYSERSLTKCIGmETRPDYC 

MKRHLSDMLTYGCTRLEIGVQSVYBDVARD 

ThWGHTVKAVCESFHLAKDSGFKVVAHMMP 

DLPNVGLERDIEQFTEFFENPAFRPDOLKLYP 

TLVIROTGLYELWKSGRYKSYSPSDLVELVA 

RILALVPPWTRVYRVQRDIPMPLVSSGVEHG 

NLRELALARMKDLGIQCRDVRTOEVGIQHH 

HKVRPYQVELVRRDYVANGGWEnJLSYEDP 

DQDILIGLLRLRKCSEETFRFELGGGVSIVKEL 

HVYOSWPVSSRDPTKFQHQGFGMLLMEEA 

ERIAREEHGSGKlAVISGVamNYYRKIGYRL 

QGPYMVKMLK 


408 


1758 


A 


3335 


3 


467 


AIASPRAAGIRHELTSTMAAGKNKRLTKGGK. 
IvuAKJUwvV/LIIMillSiLrJvii^V JHlvKlKlAblXj 
LKORVFEESLADLQND\TDOYLLRVI*VAFTT 
BRTNQI/REVFNKLIPDSIGKDIEKACQSIYPLH 
DDFARKVmuaCPKFELRKLMELHGEGSS 


409 


1759 


A 


3338 


7 


1252 


PRWRNSARDEn-LSFPQNYYIQWLNGSUHGI, 
WhOASLfSNLCLFVLMPFAFFFLESEGFAOLK 
KGIRARILETLGMLLLLALLILGIVWVASALID 
NDAASMESLYDLWEFYLPYLYSCISLMOCLL 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

hep inn in g 

nucleotide 
location 
correspondi 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
niiplMitiHe 

location 
corresponding 
to last amino 
acidresidne 
of peptide 
sequence 


Amino add sequence (A=AIamne C^Cysteine, 

F^^tioiylalanine, G=Glycinc, H=Histidine, 
Msoleucine, K=Lysiiie, L=4^cin^ 
N^M^cmine. N=Aq>aragine, P^^roline, 
QMjhitamine, R^Aigjbiine, S=Serinei 
T=Threonine, V=Valine, W=Tiyptophan, 
Y=Tyrosin^ X=mnknown, *=Stop codon, 
/^possible nucleotide deletion, Vpossible 
nucleotide insotion 














IXLCTPVGL\SRMFTVMGQLLVKPTILEDLDE 

QELENVKTLKTKLERRKKASAWERNLVYPA 
VMVLLLIETSISVLLVACNILCLLVDETAMPK 
OTRGPGIGNASLSTFGFVGAALEIILIFYLMVS 
SWGFYSLRITGNFTPKKDDTTMTKnGNCVS 
ILVLSSALPVMSRTLGITRFDLLGDFGRFNWL 
GNrnVLSYNUJAlVTrLCLVRKFTSAVREE 
LFKALGLHKLHLPNTSRDSETAKPSVNGHQK 
AL 


410 


1760 


A 


3339 


127 


1433 


GSHRFSLASPLDPEVGPYCDTPTMRTLFNLL 

WLAI^CSPVHTTLSKSDAKKAASKTLLEKSQ 

FSDKPVQDRGLWTDLKAESWLEHRSYCSA 

KAKDRHFAGDVLGYVTPWNSHGYDVTKVFG 

SKFTQISPVWLQLKRRGRHvIFEVTGLHDVDQ 

GWMRAVRKHAKOL\P*CLGSCLRTOLTMISG/ 

YvLDSmJEIEELSKrrVVQVAKNQHFDGFVVE 

VWNQLLSQKRVGLIHMLTHLAEALHQARLL 

AliVnTAnPGTIXJLGMFrHKEFEQLAPVLD 

GFSLMTYDYSTAHQPGPNAPLSWVRACVQV 

LDPKSKWRSKILLGLNFYavlDYATSKDAREP 

WGARYIQTLKDHRPRMVWDSQVSEHFFEY 

KKSMORHVVFYPTLKSLQVRLELARELGVG 

VSIWELGQGLDYFYDLL»VGIAASAVDVPFSK 

PWSE 


411 


1761 


A 


3342 


74 


2701 


VATRKLAKGFTQFAKMTEGTKKTSKKFKFFK 

FKGFGSFSNLPRSFTLRRSSASISRQSHLEPDTF 

EATQDDMVTVPKSPPAYARSSDMYSHMGTM 

PRPSKKAQNSQAARQAQEAGPKPNLVPGGV 

PDPPGLEAAKEVMVKATGPLEDTPAMEPNPS 

AVEVDPXRKPEVPTGDVEEERJPPRDVHSERAA 

GH>EAGSDYVKFSKEKYILDSSPEiaHKELEE 

ELKLSSTDLRSHAWYHGRIPREVSETLVQRN 

GDFURDSLTSLGDYVLTCRWRNQAUiFKlN 

KVWKAGESYTHIQYLFEQESFDHVPALVRY 

HVGSRKAVSEQSGAHYCPVNRTFPLRYLEAS 

YOLGQGSSKPASPVSPSGPKGSHMKRRSVTM 

TIXaXADKVTRSDGCPTSTSLPRPRDSIRSCA 

LSMDQIPDLHSPMSPISESPSSPAYSTVTRVHA 

APAAPSATALPASPVARRSSEPQLCPQSAPKT 

HCffiSDKOPHTSPSmXGKASPSPSLSSYSDPDS 

GHYCQLQPPVRGSREWAATETSSQQARSYGE 

RLKELSENGAPEGDWGKTFTVPrVEVTSSFNP 

ATFQSLLIPRDNRPL£VGL1JRKVKELLABVDA 

DTT AllUVTifA/TVT VADTT /T,\/TVl?fc/frvn Kif/^V 
UlLfAKrlV lIwVJLIV«L*VAKllfVJV 1 AiiJVll^iUyiOV 

RWGMELLTLPHGVRKLRLDLLERFHTMSIML 

AVDILGCTOSAffiRAALLHKTIQLAAELRGT 

MGNMFSFAAVMGALDMAQISRLEQTWVTLR 

QRHTEGABLYEKKLKPFUCSLNEGKEGEPLSN 

TTFPHVLPLrrLLECDSAPPEGPEPWGSTEHGV 

EWLAHLEAARTVAHHGGLYHTNAEVKLQG 

FQARPELLEVFSTEFQMRLLWGSQOASSSQA 

RRYEKFDKVLTALSHKLEPAVRSSHL 


412 


1762 


A 


3347 


1 


898 


IDRAAECRTKPLPMAVSIRGNADSIVACLVLM 

VLYLIKKRLVACAAVFYGFAVHMICIYPETYI 

LPrrLHLLPDRDNDKSLRQFRYTFQACL'ELL 

KRLCNRTALMFVAVAGLTFFALSFGFYYEYG 

WEFLEHTYFYHLTRRDIRHNFSPYFYMLYLT 

AESKWSFSLGIAAFLPQLILLSAVSFAYYRDL 

WCWFUTTSIFVTFhlKVCTSQYFLWYLCLLPL 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 

seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

bcgiiming 

nucJeotide 

location 

correspondi 

ngtn first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
add residue 
of peptide 
sequence 


Amino acid sequence (A-AIaninc OCysteine, 
D^A^artic Acid, EFOlutamic Acid, 
f^Phenylalanine; G=Glycine; Ii=Histidin^ 
l=Isoleucine, K=Lysine, L-Ceucine; 
M=Metiiionine, N=Asparagine^ P=ProIine, 
Q=Ghitaniine, R^^Aiginine, SHSerine, 
T^Threonine, V=Valine, W=Tryptoirfian, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
A=possible nucleotide deletion, V=possible 
nucleotide insertion 














VMPLVRMPWKRAWLLMLWHGQAMWLAP 
AYVLEFQGKKITLFIWLAGLFFLLINCSIUQII 
SHYKEEPLTERnCYD 


413 


1763 


A 


3361 


3 


474 


PIPVRWNSLEGRLLRGYEQHANDGKDYISRN 

♦DLRSWTAADMAAQnXRKWEAEEFAEQIKA 

YLEGTCVER/LRTHLENGKETLQLTEQSSQPTI 

PIVGIVAGLVLLGAVVTGAWSAVMCRKKNS 

GHFLPTDRVSYSEAASSDHAQGSDVSLTACK 

V 


414 


1764 


A 


3363 


1488 


453 


HQDUEUaCKILKTYNPDYDEDLVQEASSEDVL 

GVHMVDKDTERDIEMKRQLRKUIELHLYST 

AVKKYQEAMKTSLGVPQRERDEGSLGKPLCP 

PEILSETLPGSVKKRVCKPSEDHLEEFIAEHLP 

EASNQSLLTVAHADAOTQTNODLEDLEEHGP 

GQTVSEEATEVHMMEGDFDTLAELLIRDVLQ 

ELSSYNGEEEVDPEEVKTSLGVPQRGDLEDLE 

EHVPGQTVSEEATGVHMMQVDPATLAKSDL 

EDLEEHVPEQTVSEEATGVHMMQVDPATLA 

KQLEDSnTGSHQQMSASPSSAPAEEATEICIK 

VEEEVKTRKPKKKTRKPSKKSRWNVLKCWD 

IFNIF 


415 


1765 


A 


3369 


431 


315 


IPWSWVGRLSVRKMSILF*LTYNYNAILNKTP 
PSFSPSL 


416 


1766 


A 


3373 


42 


651 


RQEKMGLGEIGASGVLRSMLKERKKQNMKG 
NGNVTLTPLLPAVQCGCHLQPAGRSPLPSSHS 
APGLCSPLHPLQPQQEASTCPSGTLQGREKAA 
PGQGRPLCSLWAGGAGAVPGERGAEGRGPSD 
QAPDPKSGPWLFmjLGAPAEVRLHNVPHNL 
RRPPLP*ARGK*PPNSGCPWSEGRAKQPLSCG 
PKPQCSLPSQVPOJTH 


417 


1767 


A 


3382 


2 


2061 


EAQDPRACOPDAGGRFAARDAPGNSLRPPPS 

SPP/GWPGQLRLIPRVPGSELRCGKPERGRLP 

ASITCKIRGWPPGISKRPGLGGRSFPPGFAPKr 

WRPEARGPSVQSLPPIFSPQSAQTTAR»RPGAP 

KNAGRCGGAVRGPRLSLGPPPGPPPAPALPAR 

ASAGAGAAAAALAVGGVRGAGGARGTGGY 

GHCSC«/PT0RTC»OPQQPOPPMPARPR*AS\S 

TRGSRRGPGSRPARAAAAPRAGDHGRRPVRV 

HLRQHTAV»EFRLGDATAPPGGAAGPGAPAP 

R\GPGWDCALU>SPGPRSPRAVGCAEPEIWDP 

SPRRGTSPVPSVRSLRSEPANPRLGLPALLNSY 

PLKGPGLPPPWGPRTQTGHVinVQPSGSCIEH 

SKSUD/RGPWOAPPWQPSSSGLCSPKLATAGP 

PQSWQLCQIGRRRGLGGPGLKRGET/GLL*GC 

SMDHANRTKGPGVPTSNRCFSHIPG\GDGCSD 

HSSCEGHPDLHAGREMPAAPGLSELERVRFT 

VGCGGLASGISSASVSGLSPNRAGGPGQGDW 

EMYPVSWQTQESGGQG/SPKTGR'VGMLQA 

GAGSLQGGTGDaVWGLWEDGP/RG*DSPLPS 

GTGTEP*TPTTSIPFFPQPSGVYPSRATLLPMPS 

Y*ALGPSANKSEKPLLSFLYRGLCCRISLQLA 

KCTGQLSEffLLNVBrAFWSMWVTYFRK 


418 


1768 


A 


3398 


304 


2121 


EEEEHEEDEDDDDNNEEEEFECYPPGMKVQV 
RYGRGKNQKMYEASIKDSDVEGGEVLYLVH 
YCGWNVRYDEWKADKIVRPADKNVPKIKH 
RKKIKNKLDKEKDKDEKYSPKNCKPPALGPN 
PPFQTNPISWKWYPKLDLTDAKNSDTAHnCSI 
ETTSILNGLQASESSAEDSEQEDERGAQDMDN 
NGKEESKIDHLTNNRNDLISKHEQNSSSLLEE 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
Deotide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 

niiclentirlp 

location 
conespondi 
ng to first 
amino acid 
residne of 
peptide 
sequence 


Predicted end 
nucleotide 

corre^nding 
to last amino 
add residue 
of pqitide 
sequence 


Amino acid sequence (A-^AIanine CK^ysteine, 
I>=Aqjartic Acid, E=<}lutaniic Add, 
r~r uraiyiaianuic, ir^jiycuie^ rf^tiiSDOine, 
I=Isoleucine, K=Lysine, Lf=Leucine, 
M=Metiiionine, N=Asparagine, I^Proline, 
Q^Ohitamine, R=Aiginine, S=Serine^ 
T=Threonine, V=Valine, W=Tiyptophan, 
Y=Tyrosinc, X=Unknown, ♦=Stop codon, 
/=7ossible nucleotide deletion, \=possible 
nucleotide insertion 














NKVHADLVISKPVSKSPERLRKDIEVLSEDTD 

YEEDEVTKKRKDVKKDTTDKSSKPQIKRGKR 

RYCNTEECLKTGSPOKKEEKAKNKESLCMEN 

SSNSSSDEDEEETKAKMTPTKKYNG1£EKRK 

SLRTTGFYSGFSEVAHCRnaXNNSDERLQNS 

XAKUKKLiV WbslQCjQWrKKTLKELrSDSDTE 

AAASPPHPAPEEGVAEESLQTVAEEESCSPSV 

ELEKPPPVNVDSKPIEEFCrVEVNDRKAEFPSS 

uoNrbA IrLJ'YLjlLNKLHQSL'QKOSRQQSS 

VTVSEPLAPNQEEVRSDtSETDSTIEVDSVAGE 

LQD1,QSERE»LASRF*CQCELKQ**SARTRTS* 

KSLYRSEKSERCSGRRKFIKKAEKKP*SNSGK 


419 


1769 


A 


3399 


206 


463 


QRECLSmiGQAOIQIGDACWELYCLEHGIQP 
NGWLDTQQDQLENAKMEHTNASFDTFFCE 
TRAGKHVPRALFVDLEPTVIDGIR 


420 


1770 


A 


3408 


1010 


68S 


RRLSFFF*IWSSVLVTQARVQWRDLGSPQPLP 
PGFKRFSCXSLPSSWDVRHPSPRPVNF/HVFLV 
VMGFHHVGQAGLELLTSGDLPALASQSARTT 
GVNHCAQPROHFH 


421 


1771 


A 


3409 


355 


1326 


ADSNLIESCWQELGLGPW6GDWRVEQVQAS 

ASLRFPREVCSIRFLFTAVSLLSLFLSAFWLGL 

LYLVSPLENEPKEMLTLSEYHERVRSQGQQL 

QQLQAELDKLHKEVSTVRAANSERVAKLVF 

QRLNEDFVRKPDYALSSVGASIDLQKTSHDY 

ADRNTAYFWNKFSFWNYARPPTVILEPHVFP 

GNCWAFEGDQGQWIQLPGRVQLSDITLQHP 

PPSVEHTGGANSAPRDFAVFFLLSFFraQGLQ 

V YJJliniV&HjKj' IrUViiKiEIQTFHLQNDPPA 

AFPKVKIQILSNWGHPRFTCLYRVRAHGVRT 

SEGAEGSAQGPH 


422 


1772 


A 


3412 


2 


421 


EFDAQPSIGALVVFKRP*ATTGSDPGPKRC»1N 
YLVSCSMRSPESGKGEPGTARDYTPMGRPPP 
PVPSVSPGPLPGSLAIAPHSPEPHPWEQQPPRG 
QARSPPGGWLGSAT/RVRRPHNHP/RGH/HSP 
vul AOArAbrvjrl/VCli 


423 


1773 


A 


3420 


91 


706 


DAQRAIYSSVQPAVSLRQRQQDGAVKESGR/ 
RGGVRSFSRAAAAMAPIKVGDAIPAVEVFEG 
EPGm:VNlJ\ELJFKGKKGVLFGVPGAFrPGCS 
KTHLPGFVEQAEALKAKGVQWACLSVNDA 
FVTGEWGRAHKAEGKVRLLADPTGAFGKET 
DLLLDDSLVSIFGNRRLKRFSMWQDOIVKA 
LNVEPDGTGLTCSLAFNnSQL 


424 


1774 


A 


3421 


4 


7688 


RQVTRVGTRVLGSTTAAVFLSVEDDNDNAPQ 

FSEKRYWQVREDVTPGAPVLRVTASDRDKG 

SNAWHYSIMSGNARGQFYLDAQTOALDVV 

SP1J5YETTKEYTLRVRAQDGGRPPLSNVSOL 

VTVQVLDINDNAPIFVSTPFQATVLESVPLOY 

LVLHVQAIDADAGDNARLEYRLAGVGHDFP 

FnNNGTGWISVAAELDREEYDFYSFGVEAR 

DHGTPALTASASVSVTALDVNDNNPTFTQPE 

YTVRLNEDAAVGTSWrVSAVDRDAHSVlTY 

QITSGNTRNRFSrrSQSOG(H,VSLALPLDYKLE 

RQYVLAVTASDGTRQDTAQIWNVTDANTH 

RPVFQSSHYTVNVNEDRPAGTTWLISATDE 

DTGENARTTYFMEDSlPQFRIDADTGAVTrQA 

ELDYEDQVSYTLAITARDNGIPQKSDTTYLEI 

LVNDVNDNAPQHLRDSYQGSVYEDVPPFTSV 

LQISATDRDSGLNGRVFYTFQOGDDGDGDFI 



179 



wo 01/57188 



PCTAJSOl/03800 



SEQ ID 
NO. of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
pq>tide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

coirespondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
add residue 
of peptide 
sequence 


Amino acid sequence (A^-Alanine OCysteine, 
D-Aspartic Acid, B=Ohitamic Acid. 
F^=PhQlvlalBnine. G=G)vcine. H=:KiQHrimp 
I=lsoteudne, K=Lysine, L=4.eijcin^ 
M=Metluonine, N=Aspaiagine, P=Proline, 
Q=Glutaniine, R^Arginine, S=Serine, 
IKThreonine, V=Valine, W=Tiyptophan, 
Y=Tyrosine, X=Unknovwi, ♦=Stop codon. 
/=^ssible nucleotide deletion, \r=possible 
nucleotide insertion 














VESTSGTVRTLRRLDRENVAQYVLRAYAVDK 

GMK>ARTPMEVTVTNa,DVNDNPPVFEQDEnD 

VFVEENSPIGLAVARVTATDPDEGTNAQIMy 

QIVE(»aPEVFQLDIFSGELTALVDLDYEDRPE 

YVLVIQATSAPLVSRATVHVRLLDRNDNPPV 

IXjNFEIIJvNNYVTNRSSSFPGGAIGRVPAHDP 

DISDSLTYSFERGNELSLVIXNASTGELKLSR 

ALDNNRPLEAIMSVLVSDGVHSVTAQCALRV 

TIITDHvILTHSITLRLEDMSPERFLSPLLGLFIQ 

AVAATLATPPDHVWFNVQRDTDAPGGHILN 

VSLSVGQPPGPGGGPPFLPSEDLQERLYLNRS 

tLTAISAQRVLPFDDNICLREPCENYMRCVSV 

LRFDSSAPFIASSSVUvRPIHPVGGLRCRCPPGF 

TCaJYCETEVDLCYSRPCGPHGRCRSREGaYT 

CLCRDOYTGEHCEVSARSORCTPGVCKNOGT 

CVNLLVGGFKCDCPSGDFEKPYCQVTTRSFP 

AHSFTITRGLRQRFHFTLALSFATKERDGLLL 

YNGRFNEKHDFVALEVIQEQVQLTFSAGEST 

nVSPFVPGGVSDGQWHTVQLKYYNKPLLG 

QTGLPQGPSEQKVAVVTVDGCDTGVALRFGS 

VLGNYSCAAXQGTQGGSKKSLDLTGPIXLGO 

VPDLPESFPVRMRQFVGOMCRNLQVDSRHIDM 

ADFIANNGTVPGCPAKKNVCDSKTCHNGGTC 

VNQWDAFSCECPLGFGGKSCAQEMANPQHF 

LGSSLVAWHGLSLPISQPWYLSLMFRra.QAD 

GVLLQArrRGRSTTTLQLREGHVMLSVEGTGL 

QASSLRLEPGRANDGDWHHAQLALQAIOGP 

GHAILSFDYOQQRAEGNLGPRLHOLHLSNrrV 

GGIPGPAGOVARGFRGCLQOVRVSDTPEGVN 

SLDPSHGESINVEQGCSLPDPCDSNPCPANSY 

CSNDWDSYSCSCDPGYYGDNCTNVCDLNPC 

EHQSVCTRKPSAPHGYTCECPPNYLGPYCET 

RIDQPCPRGWWGHPTCGPCNCDVSKGFDPDC 

NKTSOECHCKENmnRPPOSPTCLLCDCYPTG 

SLSRVCDPEDGQCPCXPGVIGRQCDRCDNPF 

AEVTTNGCEVNYDSCPRAJEAGIWWPRTRFG 

LPAAAPCPKGSFGTAVRHCDEHRGWLmOJ 

NCTSITFSELKGFAmiLQRNESGLDSGRSQQL 

ALLLRNATQHTAGYFGSDVKVAYQLATRLL 

AHESTQRGFGLSATQDVHTTENLLRVOSALL 

DTANKRHWEUQQTEGGTAWLLQHYEAYAS 

ALAQNMRHTYLSPFIJVIFNIVISWKLDKGN 

FAGAKLPRYEALRGEQPPDLETrVOLPESVFR 

ETPPWRPAGPGEAQEPBELARRQRRHPELSQ 

GEAVASVnYRTLAGLLPHNYDPDKRSLRVPK 

RPIINTPWSISYHDDEELLPRALDKPVTVQFR 

LLETEERTKPICVFWNHSILVSGTGGWSARGC 

EWFRNESHVSCQCNHMTSFAVLMDVSRRE 

NGEILPLKTLTYVALGVTLAALLLITFFLTLL 

RILRSNQHGIRRNLTAALGLAQLVFLLGINQA 

DUTACTVIAILLHFLYLCTFSWALLEALHLY 

RALTEVRDVNTGPMRFYYMLGWGVPAFITG 

LAVGLDPEGYGNPDFCWLSIYDTLIWSFAGP 

VAFAVSMSVFLYOAARASCAAQRQGFEKKG 

PVSGLQPSFAVLIXLSATWLLALLSVNSDTLL 

FHYLFATCNaQGPFIFLSYWLSKEVRKALK 

LACSRKPSPDPALTTKSTLTSSYNCPSPYADG 

RLYQPVYGDSAOSLHSTSRSGKSQPSYIPFLUl 

EESALNPG\QGPPGU3GIPGR/LCFLGRFKDQQ 

H\DS*TRDFDSDLSLEDDQSGSYASTHSSDSEE 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uenoe 


SEQ ID 
NO: of 
pqitide 
seq- 
uence 


Met 
hod 


SEQ 

IDNO: 

in 

USSN 
09/4% 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ngtofiist 

amino add 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A-^Alanine OCysteine, 
D=A5paitic Acid, E=Ohitamic Add, 
F^Fbenylalanine, G=Glycine, H=Hi3tidine, 
Msoleucine, K=Lysinei, L^Leudne, 
M=Melfaionine, I^Aspaiagine, P=Proline, 
Q=Glutamine, R=Aisiiiine, S^'^erine, 
T=Threonine, V=Valine, W=Tiyptophan, 
Y^T^rrosinc, X=Unknovni, top codon, 
^=^ssible nudeotide deletion, V=possible 
nucleotide insertion 














HHKKKKKHAAFPGEQGWDSLLGPGAERLPLHS 

TPKDGGPGPGKAPWPGDFGTTAKESSGNGAP 

EERLRENGDALSRBGSLGPLPGSSAQPHKGDL 

KKKCLPTISEKSSLLRLPLEQCTGSSRGSSASE 

GSRGGPPSRPPPRQSLQEQLNGVMPIAMSnCA 

GTVDEDSSGSEFLFFNFLH 


425 


1775 


A 


3429 


155 


1417 


GEPAVQSCDCGCTQRSCPWLLVAPGLLSSSSS 

RAASVREAEDAPLQPASmPVSQGSRGPEOSL 

GSAECLPGDPLGARRATRAHSPVPGPPPSLPA 

AGTAVKRGLQPG*GA/GATSTPGTGAATGGL 

CGPAWAAPSAVGPCCCCPSISTTPSQMRSARP 

SLGCLPSWAS\PGTEHPPGPQGPGPS*DLCSV» 

KREFQRGPWAGMVILHRISAADPARAPGPDS 

NLQSALQQPATGCSEPAAVYSPPIGLWGA**P 

BYG»PQHSLPG*TAPADR*P\AGIKDRVYSNSI 

YELLH^GQRAGTCVLEYATPLQTLFAMSQYS 

QAGFSREDRLEQAKLFC31TLEDILADAPESQN 

NCmiAYQEPADDSSFSLSQEVLRHLRQEBKE 

EVTVGSLKTSAVPSTSTMSQEPELUSGMEKP 

LFLRTDFS 


426 


1776 


A 


3431 


1662 


369 


AIWWLSWLQHDLLPTPTQVAIDFTASNGDPR 

SSQSLHCLSPRQPNHYLQALRAVGGICQDYD/ 

SVGESGAGGNRQGGLAQRIPQLFLLPSDKRFP 

AFGFGAR1PPNFEVG*MRGKEGDGGRVSQAE 

KAGPHCSRLALTG\SHDFAINFDPENPECEGK 

RGDFHLPRLPADTLHTGAQTPLPRAQLPVPST 

HPRPVFI\EISGVIASYRRCLPQIQLYGPTNVAP 

UNRVAEPAQREQSTGQATKYSVLLVLTDGV 

VSDMAETRTAIVRASRlPMSmVGVGNADFS 

DMRLLDGDDGPLRO'RGVPAARDrVQFVPFR 

DFKDVSPPGPFRLKDSSASHPPKSDLRLPPFD 

VIXRTREPSWPP«SPTSPSDDPASPTXPLTPNHI 

TVPTLXAAPSALAKCVLAEVPRQWEYYASQ 

OISPGAPRPCTLATTPSPSP 


427 


1777 


A 


3446 . 


79 


9748 


GCQSCWPAWPRLRRRGPASAGARLGRKAPW 

GLPGRVQDGRPLRFCFYLRPRAPFIAPVLSGA 

ASRPEASGDCRAGRETAMATLEKIAIKAFESL 

KSFQQQQQQQQQQQQQQQQQQQQQQQPPPP 

PPPPPPPQLPQPPPQAQPLLPQPQPPPPPPPPPP 

GPAVAEEPLHRPKKELSATKKDRVNHCLTIC 

ENIVAQSVRNSPEFQKLLGIAMELFLLCSDDA 

ESDVRMVADECLNKVIKALNIDSNLPRLQLEL 

YKEKKNGAPRSLRAALWRFAELAHLVRPQK 

CRPYLVNLLPCLTRTSKRPEESVQETLAAAVP 

KIMASFQNFANDNEIKVLLKAFIANLKSSSPn 

RRTAAGSAVSICQHSRRTQYFYSWLLNVLLO 

LLVFVEDEHSTLULGVLLTLRYLVPLLQQQV 

KDTSLKGSFGVTRKEMEVSPSAEQLVQVYEL 

■n.HHTQHQDHNVVTGALELLQQLERTPPPEL 

LQTLTAVGGIGQLTAAKEESGGRSRSGSrVEU 

AGGGSSCSPVLSRKQKGKVLLGEEEALEDDS 

ESRSDVSSSALTASVKDEISGELAASSGVSTPG 

ciAurlUll 1 cA^i^KoVH 1 1.QAU0 VUjLASCDLTSS 

ATDGDEEDILSHSSSQVSAVPSDPAMDLNDG 

TQASSPISDSSQTTTEGPDSAVTPSDSSEIVLD 

GTDNQYLGLQIGQPQDEDEEATOILPDEASEA 

FRNSSMALQQAHLLKNMSHCRQPSDSSVDKF 

VLRDEATEPGDQENKPCRIKGDIGQSTDDDS 

APLVHCVRLLSASFLLTGGKNW-VPDRDVRV 

SVKALALSCVGAAVALHPESFFSFaYKVPLD 
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SEQID 
NO: of 

DUcJ- 

eotide 
seq- 
uence 


SEQID 
Nftof 
peptide 
seq- 
uence 


Met 
hod . 


SEQ 
ID NO: 
io 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

corre^pondi 

ng to first 

amino acid 

residue of 

pq)tide 

sequence 


Predicted end 
nucleotide 

n H Til n fii 

corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A-^Alanine OOysteine^ 
D=Aspaitic Acid, E^Otutamic Add, 

r ruaiyuuon inc^ OF=ujycuiG^ nr~mSQQuiCf 

I=Isoleucine. K=Lyane, I^Leudne, 
M=Mettiiontne, N=Asparagine, P=Proline|, 
Q=GIutaniine, R=Argii)ine, S^^erine, 
T=Threoninei V=Valine, W=Tiyptoplian, 
Y='I>rosine, X^Unknown, *=^top codon, 
(«=T)ossibIe nucleotide deletion, >r^pos5ible 
nucleotide insertion 














ITEYPHBQYVSDILNYIDHGDPQVRGATAILC 

GTUCSILSRSREHVGD'WMGTIRTLTGNTFSL 

ADCffLLRKTLKDESSVTCKLACTAVRNCVM 

SLCSSSYSELGLQLIIDVLTLRNSSYWLVRTEL 

LETLAEIDFRLVSFLEAKAENLHRGAHHYTGL 

UCLQERVLNNWIHLLGDEDPRVRHVAAASL 

IRLVPKLFYKCDQGQADPWAVAEDQSSVYL 

KlI.MHElX}PPSHFSVSTlTRIYRGYNLLPSrrD 

VTMENNLSRVIAAVSHELrrSTrRALTFGCCE 

ALCIXSTAFPVCIWSLGWHCGVPPLSASDESR 

KSCTVGMATMILTLLSSAWFPLDLSAHQDAL 

ILAGNLLAASAPKSLRSSWASEEEANPAATK 

QEEVWPALGDRALVPMVEQLFSHLUCVINIC 

AHVLDDVAPGPAKAALPSLTNPPSLSPERRK 

GKEKEPOEQASVPLSPKKGSEASAASRQSDTTS 

OPVrrSKSSSLGSFYHLPSYLKLHDVLKATHA 

NYKVTLDLQNSTEKFGGFLRSALDVLSQILEL 

ATLQDIGKCVEEILGYLKSCFSREPMMATVC 

VQQLLKTLFGTNLASQFDGLSSNPSKSQGRA 

QRLGSSSVRPGLYHYCFMAPYTOFTQALADA 

SLRNMVQAEQENDTSGWFDVLQKVSTQLKT 

NLTSVTKNRADKNAIHNHIRLFERLVIKALKQ 

YTTTTCVQLQKQVLDLLA(^VQLRVNYCLL 

DSDQVFIGFVUCQFEYIEVGQFRESEAnPNIFF 

FLVLLSYERYHSKQnGIPKnQLCDGIMASGR 

KAVTHAIPALQPIVHDLFVLRGTNKADAGKE 

LETQKEVWSMLUUUQYHQVLEMFILVLQQ 

CHKENEDKWKRLSRQIADIILPMLAKQQMHl 

DSHEALGVLNTLFEILAPSSLRPVDMLLRSMF 

VTPNIMASVSTVQLWISGILAILRVUSQSTED 

IVLSRIQELSFSPYLISCTVINRLRDGDSTSTLE 

EHSEGKQKNLPEETFSRFLLQLVGILLEDIVT 

KQLKVEMSEQQHTFYCQELOmAfCLIHIFKS 

GMFRRirAAATRLFRSDGCGGSFYTLDSLNLR 

ARSMITTHPALVLLWCQILLLVNHTDYRWW 

AEVQQTPKRHSLSSTKIXSPQMSGEEEDSDLA 

AKLCBICNRErVRRGALILFCDYVCQNLHDSE 

HLTWLIVNHIQDIJSLSHEPPVQDEISAVHRNS 

AASGIJaQAlQSRCEhn.STPTMLKKTLQCLEGI 

HLSQSOAVLTLYVDRLLCTPFRVLARMVDEL 

ACRRVEMLLAANLQSSMAQLPMEELNRIQBY 

LQSSGLAQRHQRLYSLLDRFRLSTMQDSLSPS 

PPVSSHPLDGDGHVSLETVSPDKDWYVHLVK 

SQCWTRSDSALLEGAELVNRIPAEDMNAFM 

MNSEFNLSLLAPCLSLGMSEISGGQKSALFEA 

ABEVTLARVSGTVQQLPAVHHVFQPELPAEP 

AAYWSKLNDLFGDAALYQSLP1I.ARALAQY 

LVWSKLPSHLHLPPEKEKDIVKFWATLEAL 

SWHUHEQIPLSLDLQAGLDCCCLALQLPGL 

WSWSSTEFVTHACSLIYCVHFILEAVAVQPG 

EQLLSPERRTNTFKAISEEEEEVDFNTQNPKYI 

TAACEMVAEMVESLQSVLALGHKRNSOVPA 

FLTPLLRNinSLAIUJ'LVNSYTRVPPLVWKLG 

WSPKPGGDFOTAFPEIPVEFLQEKEVFKEFIYR 

INTLGWTSRTQEEETWATLLGVLVTQPLVME 

QEESPPEEDTERTQINVLAVQAITSLVLSAMr 

VPVAGNPAVSCLEQQPRNKPLKALDTRFGRK 

LSUROIVEQEIQAMVSKREMATHHLYQAWD 

PVPSLSPATTGALISHEKLLLQINPERELGSMS 

YKLGQVSIHSVWLGNSrrPLREEEWDEEEEEE 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of, 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginnipg 

nucleotide 

location 

coneqwndi 

ng to first 

amino acid 

residue of 

pqitide 

sequence 


Predicted end 
nucleotide 
location 
coiiesp<Hiding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cystei]ie, 
I>=Aq)artic Acid, E=Glutamic Acid, 

pcsPhpfivlftlnninA tt tTi. i jiii.. a 
1 J iiraijf irtiniimc, \j — \jiycuVSj r^^niSuQiIiC, 

I^Isoleucine, K=Lysine, L=Leucine, 
M=^ettuoQine, N=Aspar^ine, P=Pioliiie, 
Q=Glutamine, R=Arginine, S«^erine, 
T=Threonine, V=VaIine, W=Tiyptophan, 
Y=Tyrosine, X=lInknown. ♦■sStop codon. 
/^possible nucleotide deletion, V=^x>ssible 
nucleotide insertion 














ADAPAPSSPPTSPVNSRKHRAGVDIHSCSQEL 

LELYSRWILPSSSARRTPAIUSEVVRSLLWS 

DLFTERNQFELMYVTLTELRRVHPSEDEILAQ 

YLVPATCKAAAVLGMDKAVAEPVSRLLESTL 

RSSHLPSRVGALHGILYVLBCDLLDDTAKQU 

PVISDYLLSNLKGIAHCVNIHSQQHVLVMCAT 

AFYLIENYPLDVGPEFSASnQMCGVMLSGSE 

BSTPSUYHCALRGLERLLLSEQLSRLDAESLV 

MjOVUKVNVHSPHKAMAALGLMLTCMYTG 

kekvspgrtsdpnpaapdsesvrvamhlvsvl 
fdrkkgfpcearwarilpqflddffppqdim 
nkvigeflsnqqpypqfmatwykvfqtlhs 
tgqssmvrdwvmlslsnftqrapvamatws 

L>oL.rr VaAo lorWVAAUJ'HVISRMGKLEQVD 

vnlfclvatdfyrhqieeeldrrafqsvlev 
vaapgspyhrlltclrnvhkvttc 


428 


1778 


A 


3449 


3 


430 


nsrpspsaalvevllrsgstfphtvsggwaa 
wgpwsscsrdcelgfrvrkrtctnpeprngg 
lpcvgdaaeyqdcnpqacpvrgawscwts 
wspcsascqgghyqrtrsctspapspgedicl 

uLJI 1 1 QACrliG WS 


429 


1779 


A 


3464 


583 


3 


daldrrylerchpaaggwvgege»alcqkt/ 

rfsgvlepplpslkdggrfpawt*rscskslr 

aaftsqffpsrrsraspgsapvgngqnltcqhp 

cpgscdpqvlsaswm*vehrskfrppp*nsn 

ppes/rs»qggtvqtgqhssgreagswrargr 

nagrr+kgggkigtkqgavrarkecrgema 

sgetdse 


430 


1780 


A 


3473 


2802 


270 


frmriflhcpwnqqmwkiwnlletslescka 

hlsiqkllker\q\qlpvfkhrdsivetlkrhr 

vvwaget\osgkstqvphflledlllnbwe 

askcnivctqprrisavslanrvcdelgceng 

pggrnslcoyqirmesracestrllycttgv 

llrklqedgllsnvs/hmfivdev\her\svqs 

dflliilkeilqkrsdlhlimsatvdsekfst 

yfthcpilrisgrsypvbvfhlediieetgfvle 

kdseycqkfleeeeevtinvtskaggikkyqb 

yipvqtgahadlnpfyqkyssrtqhailymn 

PHKIhaj)LILELLAYLDKSPQFItMEGAVLIFL 
P(H.AfflQQLYDLLSNDRRFYSERYKVIALHSI 
LSTQDQAAAFTLPPPGVRKIVLATNIAFrom 

pdwfvidtgrtkenkyhessqmsslvetfvs 

kasalqrqgraorvrdgfcfrmytrerfeg 

fmdysvpeilrvpleelcxhimkcnlgspedf 

lskaudppqlqvisnamnllrkigacelnepk 

ltplgqhlaalpvnvkigkmlifgaifgcldp 

VATLAAVMTEKSPFTTPIGRKDBADLAKSAL 

AMAD^riMT TTVMAVI n\l/W AT>fWtin'Vi>cxsj 
ruvi/u^oiyXlLt 111 rii\ X Lf U W AJ\.r\£v^lSU\j Y Kocl 

TYOUO^FLNRTSLL'ILEDVKQELIKLVKAAGF 

sssttstswegnrasqtlsfqeiallkavlva 
glydnvgknytksvdvteklacrvetaqgk 
aqvhpssvnrdlqthgwllyqeiaryarvy 
lrettlitpfpvllfggdievqhrerllsidgw 

rypQAPVKUVIFKQLRVLlDSVLRKKLENPK 

mslendkilqiiteliktenn 


431 


1781 


A 


3474 


1 


441 


frpapghvqp*ggssaaagggllshprpcqq 

pctpapapsrprslgslgqrvpaalataaqel 

patlggdggkpaltageaalpglhrsgvpaa 

aarc*pct/srpt*stlsptqaawwcrpsrrq 

qrgeastggasorrcgscfqv 
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SEQ ID 

Nttof 

nucl' 

eotide 

seqr 

uence 


SEQ ID 
NO: of 

seq- 
uence 


Met 
bod 


SEQ 
ID NO: 

in 
111 

USSN 
09/496 
914 


Predicted 

lieginniog 

nucieouas 

location 

conespondi 

ngtofint 

amino acid 

JCOJUUC UL 

peptide 
sequence 


Predicted end 
nucleotide 
location 
corre^xHiding 
to last amino 
add residue 
of peptide 


Amino acid sequence (A=AlanJne 0=Cystcine; 
D=Aspartic Acid, E^Olutamic Acid, 
F=Phenylalanine, G=Glycine, It=Histidine, 
MsoleudnB, K=Lysine, I>Leucine, 
M^ediionine, N^Asparagine, P=P!oline, 
Q-Ghitamine, R=Arginine, S-^aine, 
T=Threonine, V=Valine, 'W=Tiyptophan, 
Y=T^(iosine, X=Unknowii, *=Stop codon, 
A=possible nucleotide deletion, V=po8sible 
nucleotide insertion 


432 


1782 


A 


3478 


416 


23 


QIJUU.TlJWICTy/YSS*nEIAWH**KNMQID 

QWFRRE$PEIDLCKYS*LSFDKEAKA]K/WKE 

CSUmWCmO^WM/LHVQKKRI'VQTLHPS 

QKLKXSKWIKDLNVECRrrKLLDQEYPGDLGY 

SRALNSGSR 


433 


1783 


A 


3S04 


1876 


552 


CLAPCSPQPEKNGMQPLLLLLPPLLYQQLLHS 

SLOAPGESTLLVRTSKLLVGLOLQLLVWLLL 

QTRSLLALQLHLTSSAPLLAAPTAVCSCSRCS 

APRSRCVARPAARTGLPTPAPASSPAPAASPA 

PAASPAPAESTAXPQPLILLPKP/PPAPGAPPPRP 

GAPPPRPAASPSPAASPAPPAASPVLTASPPLP 

AASPSPAASPAPPAASPVLTASPPLPAASPSPA 

ASPAPPAASPVLTASPPLPAASPALAASPVHT 

ASPFVHVASPPVHTASPPVHVASPPVHTASPP 

VHVASPPVHTASPHVHVASPPVHVASPPVHV 

ASPPVHTASPPVHVASPPVHTASPHVHVASPP 

VHTASPPVHVASPPVHVASPPVHVAYPPVHV 

ASPPVHVASPPVHVASPPVSCSGDSTSDCFPP 

QPGAVFPHSLAPSLGGWSHLVAALP 


434 


1784 


A 


3316 


142 


590 


GGVNRPRSETEQVKTPVLISSWDYRHPPPRPA 

SFFVFLVmJFNTAlJVRMVLISWPCDLFrSASQ 

SAGmJVRHHAVRLLYFEQESHSVTQAGWWQ 

WHNLGSLQPLSLEDRLSPGVLGCSALCRSGV 

RTKFGINMVTSRERGTTRLPKEG 


435 


1785 


A 


3529 


1 


3161 


MSLVRAALEALDELDLFOVKGGPQSVIHVLA 

DEVQHCQSILNSLLPRASTSKEVDASLLSWS 

FPAFAVEDSQLVELTKQEimCLQGRYGCCRF 

LRDGYKTPKEDPNRLYY/ENPAEIJKLFENIEC 

EWPLFWTYFILDGVFSGNAEQVQEYKEALEA 

VLKGKNGVPLLPELYSVPPDRVDEEYQTSPHT 

\T3RVPMGKLPHMWGQSLYILGSLMAEOFLA 

PGEEDPLNRRFSTVTKPDWVQVYPSLPHGCS 

SKSPSHQCrnSIRTTRiaTAPVSILAETEEIICnL 

KDKGIYVETIAEVYPIRVQPARILSHIYSSLEIF 

IJ'FLNSVSGC^n^lRMKLSGRPYRHMGVLGTSK 

LYDIRKTIFTFTPQFnX3QQFYIj\LI»IKMIVE 

MLRTDLSYLCSRWRMTGQPTITFPISHSMLDE 

DGTSLNSSILAALRKMQDGYFGGARVQTGKL 

SEFLTTSCCraLSFMDPGPEGKLYSEDYDDN 

YDYLESGNWMNDYDSTSHARCGDEVARYL 

DHLLAHTAPHPKLAPTSQKGGLDRFQAAVQT 

TCDLMSLVTTCAKELHVQNVHMYLPTKLPQA 

SRPSFNLLDSPHPRQENQVPSVRVBIHLPRDQ 

SGEVDFKALVLQLKETSSLQEQADILYMLYT 

MKGPDWNTELYNERSATVRELLIELYGKVG 

EIRHWGURyiSGILRKKVEALDEACTDlXSH 

QKHLTVGLPPEHlElCnSAPLPYEALTQLIDEA 

SEGDMSISILTQEIMVYLAMYMRTQPtHJ'AE 

MFRLRIGLDQVMATELAHSLRCSAEEATEGL 

MNLSPSANDCNLLHHILSGKEFGVERSVRPTD 

aN VarAloUltlijAVGATlCTERTGIMQLKSEIK 

QSPGTSMTPSSGSFPSAYDQQSSKDSRQGQW 

ORRERLDGALNRVPVGFYOKVWKVI OTf m 

OLSVEGFVLPSSTTREMTPOEDCFSVHVESVL 

NRVPQPEYR.QLLVEAIL\VLTMLADIEI\HSIGS 

IIAVEKIVHIANDLFLQEQKTLGADDTMLAKD 

PASGICTLLYDSAPSGRFGTMTYLSKAAATY 

VQEFLPHSICAMQ 


436 


1786 


A 


3546 


73 


393 


CP*LTWELLEVKKAEVLQDSLDGRYSTPSSCL" 
EQPDSCRPYGRSFYALEEKHVIFSLDVGETDN 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
pqjtide 
seq- 
uence 


Met 
bod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
begjiming 
nucleotide 
location 
cone^ondi 
ng to first 
amino aa'd 
residue of 

sequence 


Predicted end 
nucleotide 
k>cation 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine 0=Cysteine. 
D^Aspaitic Acjd, B=Ghitamic Add, 
F=Phenyla]anine, OKilydne, H=4Iistidine, 
I=Isoleucine, K=Lysine, D=Leucine, 
M^Methionine, N==Aspaiagine, P=Proline, 
Q=Ghitamine, R-Arghine, S>^aine, 
TKThreonine, V=VaIine, W=Tryptophaii, 
Y=Tyn)sine; X=Unl£nown, ♦=*top codon. 
/=possible nucleotide deletbn, V^ossible 
nucleotide insertion 














KGKGKTIRGI*TFKGRKGGTYQREHDANPLA 
PXSARSCWMRKG 


437 


1787 


A 


3554 


5137 

t 


2939 


AVRAEPGLEELSSGLRAHSPSATTVCEPEAQG 

SASGCRYAAHPHWGLGOAAAAGGSWEPQPP 

RPVCEPAGRGKPHPPAAPRSPLLPGSRRRPHA 

AQPGARARTSPPPASARNMAARPAA1LAWSL 

LLLSSALLREGCRARFVAERDSEDDGEEPWF 

PESPLQSPTVLVAVLAKNAAHTLPHFLGCLER 

U)YPKSRMAIWAATDHNVDNTTEIFREWLK 

NVQRLYHYVEWRPMDEPESYPDEIGPKHWP 

TSRFAHVMKLRQAALRTAREKWSDYILFIDV 

DNFLTOPQTLNLLIAENKTIVAPMLESRGLYS 

NFWCGITPKGryKRTPDYWQlREWKRTGCFP 

VPMVHSTFUDLRKEASDKLTFYPPHQDYTW 

TTDDIIVFAFSSRQAGIQMYLCNREHYGYLPIP 

LKPHQTLQEDIENLIHVQIEAMIDRPPMEPSQ 

YVSWPKYPDKMGFDEIFMINLKRRKGQGGD 

RWLRTLYEQEIEVKIVEAVDGKALNTSQLKA 

LNIO^dLPGYRDPYSSRPLTRGEIGCFLSHYSV 

WKEVIDRELEKTLVIEDDVRFEHQFKKKLMK 

LMDNIDQAQLDWELIYIGRKRMQVKEPEKA 

VPNVANLVEADYSYWTLGYVISLEGAQKLV 

GANPFGKMLPVDEFLPVMYNKHPVAEYKEY 

YESRDLKAFSAEPLUYPTHYTGQPGYLSDTE 

TSnWDNETVATOWDRraAWKSRKQSRIYSN 

AKNTEALPPPTSLDTVPSRDEL 


438 


1788 


A 


3563 


130 


527 


DTNSSSLFCRVFCLFLRWSFTLVAQARVQ'C 

NLSSLQPLPPGFK*FSCLSPPRS*DYRRPPPRPA 

NFLYF**RQGFTVLGQAGLELLT/S/GDPPTSA 

SQJSAGITGVSHRAWPVHAlSrHISLVKTRPSLT 

TLG 


439 


1789 


A 


3565 


446 


1834 


LLQPAMRKSPGLSDCLWAWUXLSTLTGRSY 

GQPSLQDELKDNTTVFTRILDRIXDGYDNRL 

RPGLGERVTEVKTOlFVTSFGPVSDHDMEYn 

DVTTRQSWKDERLKFKGPMTVLRLNNLMAS 

KIWTPDTFFUNGKECSVAHNMTMPNKLLRITE 

lXJTIXYTMRLTVR\AECPMAFGRDFmMD\AH 

ACPLKFGSYAYTRAEWYEWTOEPARSVW 

AEDGSICLNQYDIXGQTVDSGIVQSSTGEYW 

*mHFHLKRKIGYFVlQTyLPCIMrVILSQVSF 

WIJ^RESWARTVFGVTTVLTMTTLSISARNSL 

PKVAYATAMDWFIAVCYAFVFSALIEFATVN 

YFTKRGYAWDOKSWPEKPKKVKDPLIKKN 

NTYAPTATSYTPNLARGDPGLATIAKSATIEP 

KEVKPETKPPEPKKTFNSVSKIDRLSRIAFPLL 

FGIFNLVYWATYLNREPQLKAPTPHQ 


440 


1790 


A 


3568 


1 


350 


STSSO'PAAAAAIMREIVHLQAGQCGNQIQAK 
FWEVISDEHGIDPTGTYHODSDLQLERINVYY 
NEATGEAPVPSPTALRGPRGPCLG*RPPVPAG 
GKYVPRAVLVDMEPGTMDSV 


441 


1791 


A 


3569 


2 


1751 


FVAVAGAVSGEPLVHWCTQQLRKTFGLDVS 

EEnQYVLSIESAEEIREYVTDLLQGNEGKKGQ 

FIEElJriCWQKNDQmJSDPLQQCFKKDEILDG 

QKSGDHLKRGRKKGKNRQEVPAFTEPDTTAE 

VKITFDLAKAQENSNSVKKKTKFVNLYTREG 

QDIOJVVLUKjRHPCDCLGQKHKLINNCLICG 

RIVCEQEGSGPCLFCGT1.VCTHEEQDILRGDS 

NVKSQKIiKKLMSGVENSGKVDISTKDLLPH 

QELRKSGLEKAIKHKDKLLEFDRTSIRRTQVI 
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SEQLD 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleolide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
locaticHi 
corresponding 
to last amino 
acid residue 
of peptide 
sequoice 


Amino acid sequence (A^-Alanine OCystcinc, 

r^A^illllK* AHH R?=^liifnTTiir AniH 

F=4Tienylalaaine, G=Glycine, BN4£stidine, 
I=Isoleucine, K=Cysine, L^^^oudne, 
M=^ethi<Hiine, K=Asparagine, PHE'roline, 
Q=GlutBmine, R^Aigiiiine, SN^eiine, 
T=niTeonine, V=Valine, W=Tiyptophan, 
Y=Tyrosine, X=Unknowii, ♦=Stop codon, 
A=possible nucleotide deletion. V=possible 
nucleotide insertion 














DDESDYFASDSNQWLSKLERETLQKREEELR 

SRLDETIQAIANGTLNQPLTKLDRSSEEPLGVL 

VNPNMyQSPPQWVDHTGAASQKKAFRSSGF 

GLHT^SFQHQLRIQDQEFQEGFDGGWCLSVH 

QPWASLLVRGKRVEGRSWYTPHRGRLWIAA 

TAKKPSPQEVSELQATYRLLRGKDVEFPNDY 

PSGCU.GCVDLIDCLSQKQFKEQFPDISQESDS 

PFVnCKNPQEMVVKFPIKGNPKIWKLDSKIH 

QOAKXGLMKQNKAV 


442 


1792 


A 


3576 


1 


2019 


MPRSHTGERLCEGKEOSQCAENFSPNLSVTK 

KTAOVKPYECTICXjKAFMRLSSLTRHMRSHT 

AIRAI\EKPYKCKEaGRAFSLSQILSK\HERSH 

TGEKPYKCKQCGKTFIYHQPFQKHERTHIGEK 

PYECXQCGKALSCSSSLRVHERIHTGEKPYEC 

KQCGKAFSCSSSIRVHERTHTGEKPYACK\EC 

GKAFISVTTSVLTHMrTHNGDRPYKCKECXJKA 

FIFPSFLRVHEKIHTGEKPYKCKQCGKAFRWS 

TSIQIHERIHTGEKPYKCKECGKSFSARPAFRV 

HVRVHTGEKPYKCKECOKAFSRISYFRIHERT 

HTGBKPYBCKKCGKTFNYPLDLKIHKRNHTG 

EKPYECKECAKTFlSLENFRRHMnHrGDGPY 

KCRDCGKVFIFPSALRTHERTHTGEKPYECKQ 

CGKAFSCSSYlRIHKRTHTGEK\FTfECKECGK 

AFIYPTSFQGHMRMHTGEKPYKCKECGKAFS 

T HQQRDX'DHTDTirKrVTTIfPT TJ^'♦/^V^^^^V A17C\7CTC! 

Lrtloorivuul 1 JKXnJN X cJsSr\JEf<J^ \j}\J<3is,Ar o V o 1 o 

LKKPMRNAQSDRKLY/KCEK^EKVFNSNRCF 

QSCENSH*REKSCQCK*YRKRDTR*FMYSQV 

PHNHVSVSNGPYR/CGSPIRLYNT*NISINRNL 

VAWTP*CST1FKCLWCWCKRAALSW*/IVQ 

DSGRGRWLH»VIPALWEAKAGGSRGQEIKTIL 

ANTVKPHLY 


443 


1793 


A 


3578 


287 


114 


DFYERKFEQFEGHKQIVNKWRDLLCSWKRK 
LSIIKKSVLQNNL*FSAASMRFQKVFF 


444 


1794 


A 


3582 


3335 


1909 


HLFFSLFLAAMAMTGSTPCSSMSNHTKERVT 

MTKVTLENFYSNLIAQHEEREMRQKKLEKV 

MEEEGLKDEEKRLRRSAHARKETEFLRLKRT 

RLGLEDFESLKVIGRGAFGEVRLVQKKDTGH 

VYAMKBLRKADMLEKEQVGHIRAERDILVEA 

DSLWVVKMFYSFQDKLNLYLIMEFLPGGDM 

MTLLMKKDTLTEEETQFYIAETVLAIDSIHQL 

KAHRTEFYRNLNHSLPSDFTFQNMNSKRKAE 
TWKRNRRQLAFSTVGTPDYIAPEVFMQTGYN 
laCDWWSLGVIMYEMIJOYPPFCSETPQETY 
KKVMNWKETLTFPPEVPISEKAKDUUIFCCE 
WEHRIGAPGVEEKSNSFFEGVDWEHIRERPA 
AISIEKSIDDTSNFDEFPESDILKPTVATSNHPE 
TDYKNKDWVFINYTYKRFEGLTARQAIPSYM 
KAAK 


445 


1795 


A 


3584 


1 


6169 


RTRGIEKRFAYSFLQQLIRYVDEAHQYILEFD 

GGSRGKGEHFPYEQEKFFAKWLPLIDQYFK 

NHRLYFLSAASRPLCSGGH/VSNKEKEMVTSL 

FOCUJVLVRHRISLFGNDATSIVNCLHILGQT 

LDARTVMKTGLESVKSALRAFLDNAAEDLE 

KTMENLKQGQFIHTRNQPKGVTQlINYTrVA 

LLPMLSSLFEHIGQHQFGEDLILEDVQVSCYIU 

LTSLYALGTSKSIYVERQRSALGECLAAFAGA 

FPVAFLETHLDKHNIYSIYNTKSSRERAALSLP 

TNVEDVCPNIPSLEFCLMEEIVELAESGIRYTQ 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ED NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Piedictedend 
nucleotide 
location 
coire^nding 
to last amino 
add residue 
of pqitide 
sequence 


Amino acid sequence (A^Alanine OCysteine, 
D=Aspartic Acid, E^Glutamic Acid, 
F=Phenylalanine, G=Glycine, IHfistidine, 
HIsoleucine, K=Lysine, Ls^Leucine, 
MHMethionine, N^Asparagine, PHftoline, 
Q=Glutainine, R=Aigfaine, S=6erine; 
T=Threonine, V=Valine, W=Tryptophan, 
Y=T)TOsine, X=Unkno\vn, •=Stop codon, 
/=possible nucleotide deletioii, V^ssible 
nucleotide insertion 














MPHVMEVILPMLCSYMSRWAWEHGFENNPER 

AEMCCTALNSEHMNTLLGNXLKIIYNNLGIDE 

GAWMKRLAVFSQPnNKVKPQLLKTHFLPLM 

EKLKKKAATWSEEDHLKAEARGDMSEAEL 

ULDEFTTLARDLYAFYPUJRFGDYNRAJCWL 

KEPNPEAEELFRMVAEVFIYWSKSHNFKREE 

QNFWQNEDWMSFUTDTKSKMSKAAVSDQ 

ERKKMKRKGDRYSMQTSLrVAALKRLLPIGL 

MCAPGDQELIAI-AKNRFSLKDTEDEVRDnRS 

NIHLQGKLEDPAIRWQMALYKDLPNRTDDTS 

DPHCTVERVLDIANVLFHLEQKSKRVGRRHY 

CLVEHPQRSKKAVWHKLLSKQRKRAWACF 

IU4APLYNLPRHRAVNLFLQGYEKSWIETEEH 

YFEDKLIEDLAKPGAEPPEEDEGTKRVDPLHQ 

LILLFSRTALTEKCKLEEDFLYMAYADIMAKS 

CHDEEDDDGEEEVKSFBEKEMEKQKLLYQQ 

ARLHDRGAAEMVLQrnSASKGETGPMVAAT 

LKLGIAILNGGNSTVQQKMLDYLKEKKDVGF 

FQSLAGLMQSCSVLDLNAFERQNKAEGLGM 

VTEEGSGEKVLQDIMEFTCDLFRFLQLLCEGH 

NSDFQNYLRTQTGNNTTVNmSTVDYLLRVQ 

ESISDFYWYYSGKDVIDEQGQRNFSKAIQVA 

KQVFNTLTEYIQOPCTGNQQSLAHSRLWDAV 

VGFLHVFAHMQMKLSQDSSQIELLKELMDLQ 

KDMWMLLSMLEGNWNGTIGKQMVDMLV 

ESSNNVEMIIJCFFDMFLKLKDLTSSDTFKEYD 

PDGKGVIFKRDFHKAMESHKHYTQSETEFLL 

SCAETDENETLDYEEFVKRFH^PAiCDIGFNVA 

V1XT^^^H^1PNDTRLQTFLEIJ^ESVLNYFQP 

FLGRIEIMGSAKRIERVYFEISESSRTQ'WEKPQ 

VKESKRQF1FDVVNEGOEKEKMELFVNFC5ED 

TTFEMQLAAQISESDLNERSANKEESEKERPEE 

QGPRlklAFFSILTVRSALFALRYNILTLMRMLS 

UCSUCKQMKKVKKMTVKDMVTAFFSSYWSI 

FMTLLHFVASVFRGFFRUCSLLLGGSLVEGA 

KKDCVAELLANMPDPTQDEVRGDGEEOERKP 

LEAALPSEDLTDLKELTEESDLLSDIFOLDLKR 

EGGQYKIJPHNPNAGLSDLMSNPVPNIFEVQE 

KFQEQKAKEEEKEEKEETKSEPEKAEGEDGE 

KEEKAKEDKOKQKLRQLHTHRYGEPEVPESA 

FWKKUAYQQKLLNYFARNFYNMRMLALFV 

AFAINFILLFYKVSTSSWEGKELPTRSSSENA 

KVTSLDSSSHRIIAVHYVI^SSGYMHPTVRIL 

PILHTVISFFCnGYYCLICVPLVIFKREKEVARK 

LEFDGLYTTEQPSEDDIKGQWDRLVINTQSFP 

NNYWDKFVKRKVMDKYGEFYGRDRISELLG 

MDKAALDFSDAREKKKPKKDSSLSAVLNSID 

VKYQMWKLGWFTDNSFLYLAWYMTMSVL 

GHYVNNFFFAAHLLDIAMGFKTLRTILSSVTH 

NGKQLVLTVGLLAVWYLYTWAFNFFRKF 

YNKSEDGDTPDMKCDDMLTCYMFHMYVGV 

RAGGGIGDEIEDPAGDEYEIYRnFDITFFFFVI 

VBLLAnQGLIIDAFGELRDQQEQVKEDMETKC 

FICGIGNDYFOTVPHOFEIHTLQEHNLANYLF 

FLMYLINKDETEHTGQESYVWKMYQHICWE 

FFPAGDCFRKQYEDQLN 


446 


1796 


A 


3592 


1 


355 


AGLELLNSDDPPALASQSAGiTGVTRTPSLFF* 
DTVIXCCSGWSAVAPSRLTAALFS*AQAVCL 
SLPRSWDYRRW/PPHPANFOFCRDE/SLA/ML 
PRLVSNSWTQAnXPRPPKMLOuQV 
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SEQID 


Met 


SEQ 
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Predicted end 


Amino acid sequence (A=Alanine OCystcinc; 


NO: of 


NO; of 


hod 


BONO: 


beginning 


nucleotide 


D=AsparticAc!4 E=01utamicAcid, 


nucl- 


pefitide 




in 


nucleotide 


location 


FHPbeaylalanine, G=<ilycine, H=H^dine, 


eotide 


seq- 




USSN 


location 


corre^nding 


I-Isoloidne, K=Lysine, I^Leucine, 


seq- 


uence 




09/496 


coirespondi 


to last amino 


K(=Medil(Hiine, N=Aspaiagine, IMProline, 


uence 






914 


ngtofiist 
amino acid 
residue of 
peptide 
sequence 


add residue 
of peptide 
sequence 


Q=Glutamine, BpArgbtine, S=^erinc, 
T^HirBoninc, V=Valine, W^TiyptoiAan, 
Y=Tyrosinei X=Unknown. ♦=Siop codon. 
/=possible imdeotide deletion, V^xssible 
nucleotide inserticHi 


447 


1797 


A 


3598 


1202 


1070 


LFVGGGPICPEGASGFAPGPAPAPRVGYDAEV 

GR*V»GAAASQGA/GSLRPRPTGPGHPGAWL 

QVWGAAAVCAGPAM*/AVRAKRGPRAG*EP 

NSPWRSGVLAA\RAVGAGPWP*P»PGCS*ARG 

PSSRSAPGLASGPAAPLLQGVHSSAGPLLCYI 

NGTLALGLKP**AWGWGEWRPKG 


448 


1798 


A 


3604 


3115 


557 


FRRKGQGOPKDFOAGLKYNSRHEKVNGLEE 

GVEFLPVNNVKKVEKHGPGRWWLAAVUG 

UXVLLGIOFLVWHLQYRDVRVQKVFNGYM 

RFINENFVDAYENSNSTEFVSLASKVKDALKL 

LYSGVFFLGPYHKESAVTAFSEGSVIAYYWSE 

FSIPQHLVEEAERVMAEERWMLPPRARSLKS 

FWTSWAFFTDSKTVQRTQDNSCSFGLHAR 

GVELMRFTTPGFPDSPYPAHARCXJWALRGD 

ADSVLSLTFRSFDLASCDERGRHLVXTVYNTtt- 

SPMEPHA\LVQLCGTYPPSY^JLTFHS\S\QNVL 

IjnjmJTERRHPG\FEATFFQLPRMSSCGGRL 

RKAQGTFNSPYYPGHYPPNIDCTWNIEVPNN 

QHVKViaiCnnfLI.BPGVPAGTCPKDYVEING 

EKYCGERSQFVVTSNSNKrrVRFHSDQSYTDT 

(ffLAEYLSYDSSDPCPGQFTCRTGRCIRKELR 

CDGWADCTDHSDELNCSCDAGHQFTCKNKF 

CKPLFWVCDSLNDCGDNSDEQGCSCP\AQTF 

RCSNGKCLSKSQQCNGKDDCGDGSDEASCP 

KVNWTCrKHTYRCLNGLCLSKGNPECDGK 

EDCSDGSDEKDCDCGLRSFTROARWGGTD 

ADEGEWPWQVSLHALGQGHICQASLISPNV/L 

VSAAHCYIDDRGFRYSDPTQWTAFLGLHDQS 

QRSAPGVQERRLKRIISHPFFNDFTFDYDIAIX 

ELEKPAEYSSMVRPICLPDASHVFPAGKAIWV 

TGWGHTQYGGTGALILQKGEIRVINQrTCEN 

LLPQQITPRMMCVGFLSGGVDSCQGDSQGPL 

SSVEADGRIFQAGWSWGDGCAQRNKPGVY 

TRLPLFRDWIKENTGV 


449 


1799 


A 


3618 


2 


613 


FVSGSPWRMDGSTERLEARRPAGRLPWSSRQ 

EMTRRPSLMAGRQHGWSAQQSATVANPVPG 

ANPDIiPHFLGEPEDVYTVKNKPVLLVCKAV 

PATQIFFKCNGEWVRQVDHVIERSTOGSSOLP 

TMEVRINVSRQQVEKVFGLEEYWCQCVAWS 

SSGTITCSQKAYIRIAYLRKNFEQEPLAKEVSL 

EQGIVLPCRPPEGIPPAB 


450 


1800 


A 


3620 


1 


2676 


MEPSLGQGMDLTCPFGVSPACGAQASWSIFG 

ADAABVPGTRGHSQQEAAMPHIPEDEEPPGE 

PQAAQSPAGQQGPPTAGVSCSPTPnVLTGDA 

TSPEGETDKNLANRVHSPHKRLSHRHLKVST 

ASLTSVDPA(aniDLVNDQLPDISlSEEDKKKN 

LAIXBEAKLVSERFLTRRGRKSRSSPGDSPSA 

VSPNLSPSASPTSSRSNSLTVPTPPEODEADVS 

SPHPGEPNVPKGLADRKQNDQRKVSQGRLAP 

RPPPVEKSKElAIEQKENFDPLOYPBrTPKGLA 

PVTNSSGKMALNSPQPGPVESELGKQULKTG 

WEGSPLPRSPTQDAAGVGPPASQGRGPAGEP 

MGPEAGSKAELPPTVSRPPLLRGLSWDSGPEE 

PGPRLQKVLAKLPLAEEEKRFAGKAGGKLAK 

APGLKDFQIQVQPVRMQKLTKLREEHILMRN 

QNLVGLKLPDLSEAAEQEKGLPSELSPAIEEE 

ESKSGLDVMPNISDVLLRKLRVHRSLPGSAPP 

LTEKBVENVFVQLSSAFRNDSYTLESaUNQAE 

RERNLTEENTEKELENFKASrrSSASLWHHCE 

HRETYQKLLEDIAVLHRLAARLSSRAEWGA 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
ttcDce 


SEQ ID 
NO: of 
peptide 
seq- 
uence 
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hod 


SEQ 
ID NO: 
in 
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Predicted 

beginning 

nucleotide 

location 

coirespondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
add residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteioe, 
D=Aspartic Acid, &<}lutamic Acid, 
F=Phenylal8nine, G=01ycine, I^IQstidine, 
I=Isoleucine, KHLysin^ L^Leucine, 
M=^eduonine, N^Asparagine, P=^*rolinei 
Q=Glutamine, R^^Argiiune, S=^ennei, 
T=Tlireonine, V=Valine, W=TiyptojAan, 
Y=TiTOsine, X=Unknown. top codon, 
/=possible nucleotide deletion, V=possible 
nucleotide insotion 














VRQEKRMSKATBVMMQYVENLKRTYEKDH 

AEL1V4EFKKLANQNSSRSCGPSEDGVIRTARS 

MSLTLGKNMPRFRVSVAWPKFNALNLPGQ 

TPSSSSIPSLPALSESPNGKGSUPVTSALPALLE 

NGKTNGDPDCE/^APALTLSCLEELSQETKA 

RMHEEAYSKGFQBGLKKTKELQDLKEffiEEQ 

KSESPEEPEEVffiTCEEEKDPRSSKLEELVHFL 

QVMYPKLCQHWQVIWMMAAVMLVLTWL 

GLYNSYNSCAEQADGPLGRSTCSAAQKDSW 

WSSGLQHEQPTCQ 


451 


1801 


A 


3623 


504 


198 


QUQHQTVHTGRKLYECKECGKAFNQGSTLI 
RHQRIHTGEKPYECKVCGKAFRVSSQLKQHQ 
RIHTGERPYQCKELKGRGAEMLAVLAVKEQ 
NRTPVNY(HC 


452 


1802 


A 


3fi28 


2 


195 


MTCLHSAKAFHy*SSCSFSCEEGFALIGPEW 

QCTALGVWTAPAPVCIAVQCQHLEALNEOT 

MG*DYPFTAFAyGSSCXYECHTVYRVRGLD 

MLHSRGCYLWNGHFTT^EAISCEPLEKPCH^S 

V*CSFSCEEGFAUGPEWQCTALGVWTAPAP 

VCIAVQCXJHLEALNEGTMG 


453 


1803 


A 


3637 


662 


142 


IQAKGLGIWHVPNKSPMQHWR\KGSLLRYRT 
DTGFLQTLGHNLLGIYQKYPVKYGEGKCWT 
DNGPVIPWYDFGDAQKTASYYSPYGQREFT 
AOFVQFRVFNNERAANALCAGMRVTGCNTE 
HHaOGGGYFPEASPQQCGDFSGFDWSGYGT 
\HVGYSSSRErrE\AAVIXFYR 


454 


1804 


A 


3641 


1 


362 


TQVHPAMLGLDELGRSGCGHCTQADLRFGD 
AAGRDPGQDNDKNTAEPAFPPPPRVMAAAA 
ALRAPAQSSVTFEDVAVNFSLEEWSLLNEAQ 
GCLYHDVMLETLTUSSLGKVLILNCDLS 


455 


1805 


A 


3646 


2 


414 


AAAGRGASGALTGEGGGEQGRRVGLGSRAH 
SLLLGPTFNSCQVSSQPPRVAGLGLPLKHEPS 
RPQPPSPRGPRTVRAGVPGAHPQDTPCPEFVR 
PRKVPLVGEAPGLPPEBRSRGWRRDTPGLQE 
SRVRAPSYDDIT 


456 


1806 


A 


3656 


396 


8 


QrVSFNSYLTLYTKNNUCSlvaaJLNVNTEMIK 
LIilJKMHNU3*AKFFLN»IQKAIJKBKILIHW 
P/LKDC/SFCSCTOTIKKMKRQTIVWEQTFnHI 
SVKELVSRIYEAFLQFNKTVNRPVFDIKKEQK 
F 


457 


1807 


A 


3660 


14 


1961 


SEAKLQGPTGMDLWQLLLTLALAGSSDAFSG 

SEATAAILSRAPWSLQSVNPGLKTNSSKEPKF 

TKCRSPERETFSCHWTDEVHHGTKNLGPIQLF 

YIRRNTQEWTQEWKECPDYVSAGENSCYFN 

SSFTSIWIFyCIKLTSNGGTVDEKCFSVDEIVQ 

PDPPIALNWTLLNVSLTGIHADIQVRWEAPRN 

ADIQKGWMVLEYELQYKEVNETKWKMN4DP 

ILTTSVPVYSLKVDKEYEVRVRSKQRNSGNY 

GEFSEVLYVTLPQMSQFTCEEDFYFPWLLniF 

GIFGa-TVMLFVFLFSKQQRIKMLILPPVPVPKI 

KGIDPDLLKEGKLEEVNmAIHDSYKPEFHS 

DDSWVEFDELDIDEPDBKTEESDTDRLLSSDH 

EKLBDNLGVKDGDSGRTSCXirEPDILETDFNAH 

DMEOTSEVAQPQRLKGEADLLCLDQKNQNN 

SPYHDACa?ATQQPSVIQAEKNKPQPLPTEGAE 

STHQAAHIQLSNPSSLSNIDFYAQVSDITPAGS 

WLSPGQKNKAGMSQCDMHPHvIVSLCQENF 

LMDNAYFCEADAKKCIFVAPHIKVESHIQP\S 

LNQEDmTTESLTVTAAGSIAGTGEHVPGSEM 
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Amino acid sequence (A<=Alaninc 0=Cysteine, 
I>=Aspartic Add, &<jlu]amic Add, 
F=PIienylalanine, CN<jlycine, H=IQ5tidine, 
I=Isoleucine, K=4.ysine, U=Leucine, 
Kt=Metfaionine, N=Aq>aiagine, P=Froline, 
Q=Ghitanune, R^Arghiine, S=^ erine, 
T=Threonine, V=Valine, W=Tryptophan, 
V=Tyrosine, X=Unknown. ♦=Stop codon, 
/=^)ossible nucleotide deletion, V^ossible 
nucleotide insertion 














PVPDYTSmiVQSPQGLILNATALPLPDKEFLS 
SCGYVSTDQLNKIMP 


458 


1808 


A 


3663 


154 


462 


TRAPASGRSGAGLALSANAPDSQGHPGATEG 
PAGSLAHASGSARGTWKVRGRGSHGWERTV 
GAGGCANPVPALHSCASAPRGTGRVSALGPK 
TGSSPLSSPKG 


459 


1809 


A 


3664 


902 


135 


LQKYNTSMALFDFVLHNSTGEIRYITEDDVIQ 

SQNAlXJKYNTSMAU'ESNSFEICnLESPYYVD 

lJ<QTLFVQVSLHrSDlW.VVHJXrC3LASPTSD 

FASPTYDLDCSGCSRDETCKWYPLFGHYGRF 

QFNAFKFLRSMSSVYLQCKVUCDSSDHQSRC 

VNQGCVSRSKRDISSYKWKTDSnGPIRUaiDR 

SAVNGNSGFQHETHAEETPNQPFNSVHLFSFM 

V1JVL1WVTVATITVRHFVNQRADYQ\YQKLQ 

NY 


460 


1810 


A 


3670 


850 


557 


LGILMSPQVEAGEI*ALLTPPPaCMQFSPLTL/P 

K*WVSPGLTP/PPPEVPSVFLVEPGLPHAGQA 

GLDLLVTSGDPPASTSQSARITDVSHRAQPLAI 

s 


461 


1811 


A 


3671 


2472 


2099 


IGVLAFETGSCSVTKLYCIGIIMPHCSLDLAGSV 
TSAFRIAGTTSVHHHPQLTFFFFWIETGSHCV 
VQTGL*LLALSNPPALASQ1AGISGMSHRAWP 
GLVLYSa-EFSIXCASQSLIMLFTCYNE 


462 


1812 


A 


3672 


394 


110 


VKPVNGESKRD'GADTQTCEGBADEQLQTVN 
rYYD/STKSFFYISPO*K\RKPTWAFMRRT NA 
KMFGIPLHSNSDPWGYEm?EVIGFHRSRVSRG 
HGS 


463 


1813 


A 


3673 


348 


1 


QRNPFSAGHPQRPPTSGSQSELLAQPRLRPGR 

KSSFSRDQDVW* SQAVPBCRQ»QKNPFSAGHP 

QRPPTSGSQSELLAQPRLRPGRKSSFSRDQDV 

WPGQKPRPSQQQHQMCASPTLGQRSPFALEP 

VPAYHGGRDPFASARPSPVGIPKPRAAPAGG 

GWRRKPKSSTK 


464 


1814 


A 


3676 


2253 


320 


PVIQRCSQPYGFSLHSFFLKCVSETSQQPPSR 
KVFQLLPSFPTLTRSKSHESQLGNRIDDVSSM 
RFDLSHGSPQMVKRDIGLSVTHRFSnCSWLS 
QVCHVCQKSMIFGVKaKHCaUJCCHNKCrKE 

apacrisflpltrlrrtesvpsdinnpvdraab 

phfgtlpkaltkkehppamnhldsssnpsstt 

fstpsspapfptssnpssattpp\npsp\gqr\dsr 

fnfpsc/ayfihhr\q\qfifpdisafahaaplpe 

aadgtrlddqpkadvleaheaeaeepeagk 

seaeddedevddlpssrrpwrctisrkasqts 

vylqewdipfeqvhx}ep1gqgrwgrvhrgr 

whgevairij:.emdghnqdhlklfkkevmn 

YRQTTUIENVVLFMQACMNPPHLAirrSFCXO 

rtlhsfvrdpktsldinkrrqiaqbnkgmgy 

lhakgxvhkdlksrnvfydngmcwtidfglf 

\gisgvvpvegrrenqlklshdwlcylapeivr 

emtpgkdedqlpfskaadvyafgtvwyblq 

ardwplknqaaeasiwqiosgegmkrvlts 

vslokevsenlsacwafdlqerpsvfsllmd 

mleklpklnrrlshpghf*ksadinsskwpr 

ferfglgvlessnfkm 


465 


181S 


A 


3679 


8 


803 


ipspawwnstwadtfsuXaLavalyLgyy 

wacao-qthrafcasnredletvvnhikhryp 

qapllavqisfoqilvlnhlaqarqaaalva 

altlsacwdsfettrsletplnsllfnqplta 

glcqlverlsy/e*dlqartirqfderytsva 
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Amino acid sequence (A=Alanine OCysteine, 
I>=Aspartic Add, B=Glutamic Add, 
F^Phraylalanine, (Mjlycine^ IHIistidine^ 
I— Isoleucine, K=LysinB, L=leucio^ 
M^Methionine, N=Asparagine, P=Proline, 
Q=01utainine. R=Aigiiiine, S=^erinei 
T=Threonine, V=VaIine, W=Tiyptophan, 
Y=Tyrosine, X=Unknown, *=^top codon, 
/^ssible nucleotide deletion, \=possible 
nucleotide insertion 














FGYQDCVTYYKAASPRTKIDAIRIPVLYLSAA 
DDPFSTVCALPKQAAQHSPYVALLITARGCaffl 
GFLEGLLPWQHWYMSRLLHQYAKAIFQDPE 
GLPDLRALLPSEDKNS 


466 


1816 


A 


3684 


3 


307 


SSQY1VQSKTKIFL*AAREKQ/RHTCRRFSIRLS 
ANISSQTGEARGQWPSVFKVLKEKKLSTKKS 
FGQK*GR\RKTFPDKQK/LREFDTTRPTIQEML 
TGVLCJG 


467 


1817 


A 

■- 


3687 


2465 


837 


ELPTPLIAAHQLYNYVADHASSYHMKPLRMA 

RPGGPEHNEYALVSAWHSSGSYLDSEGLRHQ 

DDFDVSLLVCHCAAPFEEQGEAERHVLRLQF 

FWLTSQRELFPRLTADMRBFRKPPRLPPEre 

APGSSAGSPGEASGLILAPGPAHJFPPLAAEVG 

MARARLAQLVRLAGGHCRRDTLWKRLFLLE 

PPGPDRLRLGGRLALAELEELLEAVHAKSIGD 

IDPQLDCFLSMTVSWYQSLlKVLLSKFPQSai 

HFQSPDLGTQYLWLNQKFTDCFVLVFLDSH 

LGKTSLTWFREPFPVQPQDSESPPAQLVSTY 

HHLESVINTACFTLWTRLL*GSGLDH»MSLFL 

ESWAYQ1A(X}RQD*PALLGPRASQTLSDTKG 

FVTMS*GSAAPAWQQEPPSPNTHSH*PIQDSR 

ESGQPRGPLGPFWGTPFGPPGRVSGVHTGWQ 

TPPRAPLPESCPLVPLTTVSHLCPLSLRVFTSHL 

DITAGHSHRDDTWVPIPALPUCHLRPPSSPFA 

LGPWVSHPLMRWVQKLSHLHSNPGTGFSMG 

GKQQRN 


468 


1818 


A 


3691 


960 


499 


QTCRKDKRAIYPHFQNE*MNEIKAI*SGTGGI 

QCFHSQNDSAFFFFLFLLETEFCSAA/rVQWH 

DFLSMQPPPPGFKQFTCLSLLSSWNYRRVPPPF 

PGNn»FLVKTGFPHVGQTGFELLTSSDLAPLA 

SQNGGITGMSPCAWPFFFFITFGLC 


469 


1819 


A 


3714 


4747 


495 


MAYSWQTDPNPNESHEKQYEHQEFLFVNQP 

HSSSQVSLGFDQIVDEISGKIPHYESEIDENTFF 

VPTAPKWDSTGHSLNEAHQISLNEFTSKSREL 

SWHQVSKAPAIGFSPSVLPKPQNTNKECSWG 

SPIGKHHGADDSRFSILAPSFTSLDKINLEKEL 

ENENHNYMGFESSIPPTNSSFSSDFMPKEENK 

RSGHVNIVEPSLMLLKGSLQPGMWESTWQK 

NIESIGCSIQLVEVPQSSNTSLASFCNKVKKIR 

ERYHAADVNFNSOKIWSTTTAFPYQLFSKTK 

FNIHIFIDNSTQPLHFMPCANYLVKDLIAEILH 

FCTNDQLLPKDHILSVWGSEEFLQNDHCXGS 

HKMFQKDKSVIQLHLQKSREAPGKLSRKHEB 

DHSQFYLNQLLEFMKWKVSRQCLLTLIRKY 

DFHLKYLLKTQENVYNIIEEVKKICSVLOCVE 

TKQITDAVNELSLILQRKGENFYQSSETSAKG 

LIEKVTTELSTSIYQLINVYCNSFYADFQPVNV 

PRCTSYU^GLPSHLSFTVYAAHNIPETWVHR 

INFPLEKSLPRESMLTVKLFGIACATNNANLL 

AWTCLPLFPKEKSILGSMLFSMTLQSEPPVEM 

rrPGVWDVSQPSPVTLQEDFPATGWEYMKPD 

SEENRSNLEEPLKECKHIARLSQKQTPLLLSE 

EKXRYLWFYRFYCNNENCSLPLVLGSAPGW 

DERWSEMHTILRRWTFSQPLEALGIXTSSFP 

DQEIRKVAVQQLDNLLNDELLEYLPQLVQAV 

KFEWNLESPLVQLULHRSLQSIQVAHRLYWL 

LKNAENEAYFKSWYQKLLAALQFCAGKALN 

DEFSKEQKUKILGDIGERVltSASDHQRQEVL 

KKEIGRlJEEFFQDVIITCHLPLNPAUnKGIDH 

DACSYFTSNALPLKnriNANLMOKNISnFKA 
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seq- 
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SEQ ID 
Ntt of 
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seq- 
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09/496 
914 


Predicted 
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location 

conespondi 

ngtofitst 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=AIanine C=Cysteine, 
D^Aspartic Acid, EKilutamic Acid, 
F=4%enylalanine, G=Glycine, I^Histidine, 
I=IsoIeudne, K=Lysine, Lr=I>eucine, 
K^Methionine, K^'Asparagine, M^roline, 
Q=Gh]tamine, R=Argfa)ine, S=^ainc 
l'=^rhreonine, V=Vallne, W=^iyptophan. 
Y=7yn)sine, X=Unlaiown, ♦=Stop codon, 
/=possible micleotide deletion, \=possible 
nucleotide insertion 














QDDLRQDMLVLQUQVMDNIWLQEOLDMQ 

MIIYRCLSTGKDQRLVQMVPDAVnLAKIHRH 

SGUGPLKENTIKKWFSQHNHLKADYEKALR 

NFFYSCAGWCVVTFDU3VCDRHNDNIMLTKS 

GHMEHIDFGKFLGHAQTFGGIKRDRAPFIFTS 

mvI\EYFrrEGG\KNPQHFQDFV\ELCCRAYNIIR 

KHSQLLL\NLL\EMMLYAO\LFELSQI\QDLKY 

VYNNLRPQDTDLEATSHFTKKIKESLECFPVK 

U^NUHTIAQMSAISPAKSTSQTrFPQESCLLST 

TRSIERATILGFSKKSSNLYUQVTHSNNETSL 

TEKSFEQFSKLHSQLQKQFASLTLPEFPHWW 

HIJPFTOSaJHRRFRDLNHYMEQILNVSHEVTO 

SDCVLSFFLSEAGQQTVEESSPVYLGEKFPDK 

KPKVQLVISYEDVKLTILVKHMKNIHLPDGSA 

PSAHVEFYlXFini^VIUWKTKSVPKCTDPTY 

NEIWYDEVTELQGHVLMLIVKSKTVFVGAI 

NIRLCSVPLDKEKWYPLGNSn*Pm,FSSFGM 

KSLEKDEFVGGMLLSNPIW 


470 


1820 


A 


3718 


430 


75 


SHGSISILNLHQGCVFLPSU'AQGLRCYRCLA 
VLEGASCSWSCPFUXjVCVSQKVSV/CWQ*/ 
OWGARAEGRLSAWDSQISCXaCGDLCNAV 
VLAAGSPWALCVQLLLSLGSVFLWALL 


471 


1821 


A 


3723 


891 


494 


LRQSL/NSVPQAGVQWRDSSLQAPPPRFTPLS 

CXSLPSSWD'5nRJRLPPCLANFLVF**RRGFmL 

ARMVLIS*PRDPPASASQ\SrarrGGSHRAQHP 

TDSRDHSERSVKKSHEVISELRMKVIKCKVAF 

SKNPI 


472 


1822 


A 


3734 


443 


251 


GFlKr*NFCVSKDTSKKLS/RLPTKWKNVFAN 
♦ISDKGLVSRICQELLRHLDAEQVSSTAOLSL 


473 


1823 


A 


3746 


3 


300 


THASGGARSGAGWAGRGVRAGTEAGRGGIF 

LTLSILRTRDLPSGAMSEGVDLIDIYADEEFNQ 

DPEFNNTDQIDLYDDVLTATSQPSDDRSSSTE 

PPPPVRQEPSPKPNNKTPAILYTYSGLRNRRA 

AVYVGSFSWWTTDQQUQVIRSIGVYDVGEV 

KFAENRAK 


474 


1824 


A 


3733 


2 


5262 


RPLFAREGGIYAVLVCMQEYKTSV\LVQQAG 

LAALKMLAVASSSEIPTFVTGRDSIHSLFDAQ 

KfTREIFASIDSAraPGSESLLLTVPAAVILMLN 

TEGCSSAAlWGUjXNLLLCNHHTLGDQirrQ 

H.RDmTmSGIAPRTEPNflTraTILMMLX,NR 

YSEPPGSPVERAALETPIIQGQDGSPELLIRSLV 

GGPSAELLLDLERVLCREQSPGGAVRPLLKRL 

QQETQPFIXLLRTLDAPGPNK7LLLSVLRVIT 

RLLDFPEAMVLPWHEVLEPCLNCLSGPSSDSE 

IVQBLTCFLHRLASMHKDYAWLCCLGAKEI 

LSKVLDKHSAQUXOCHLRDLVTECEKYAQL 

YSNLTSSILAGCIQMVLGQIEDHRRTHQPINIP 

FFDVFLRHLCXJGSSVEVKEDKCWEKVEVSSN 

PHRASKLTDHNPKTYWESNGSTGSHYITLHM 

HRGVLVRQLTLLVASEDSSYMPARVWFGG 

DSTSaGTELNTVNVMPSASRVILLENLNRFW 

PHQIRIKRCQQGGIDTRVRGVEVLGPKPTFWP 

LFREQLOUlTCLFYnRAQAWSRDlAEDHRRL 

LQLC3>RLNRVLRHEQNFADRFLPDDEAAQAL 

GKTCWEALVSPLVQNITSPDAEGVSALGWLL 

DQYLEQRETSRNPLSRAASFASRVRRLCHLL 

VHVH»PPGPSPEPSTRPFSKNSKGRDRSPAPSP 

VLPSSSLRNITXJCAVLSVVQEQVSRFLAAAWR 

APDFVPRYCKLYEHLQRAGSBLFGPRAAFML 
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SEQ DO 
NO: of 
nucl- 
eotide 
seq- 
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SEQ ID 
NO: of 
pq)tidc 
seq- 
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Met 
hod 
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ID NO: 
in 
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nucleotide 
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correspondi 

ngtofiist 

amino add 
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peptide 

sequoice 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
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Amino acid sequence (A=Alanine OCysteine, 
I>=Aspaitic Acid, EF<}lutamic Acid, 
F=i1iciiyla]anine, G=Glycinei H=4Iistidine; 
h=Isoleucine, K=Lysiiie, IMieucine, 
M=Metliianine, N=AsparaginB, P=Proline, 
QKxlutaniine, R=Arg;&iine, Serine, 
T=Threonine, V=Valine, W=Tiyptophan, 
Y=TiTOsinc, X-Unknowh, *=Stpp codon, 
/=possible nucleotide deletion, V=possible 
nucleotide insertion 














ALRSGFSGALLQQSFLTAAHMSEQFARYIDQ 

QIQGGLIGGAPGVEMLGQLQRHLEPIMVLSG 

IJHATTITEHFYQHYMADELLSFGSSWLEGAV 

LEQIGLCFPNRLPQLMLQSLSTSEELQRQFHLF 

QLQRLDKLFLEQEDEEEKRL*EEEEEEEEEEA 

EKELFIEDPSPAISILVLSPRCWPVSPLCyLYHP 

RKOJTEFCDALDRFSSFYSQSQNHPVLDMG 

PHRRLQWTWLGRABLQFGKQILHVSTVQMW 

LIXKFNQTEEVSVETLLKDSDLSPEUiQALV 

PLTSONGPLTLHEGQDFPHGOVLELHEPOPQ 

RSGEALWUPPQAYLNVEKDEGRTLEQKKNL 

LSCLLVRILKAHGEKGLHIDQLVCLVLEAWQ 

KGPNPPGTLGHTVAGGVACrSTDVLSCILHLL 

GQGYVKRRDDRPQILMYAAPEPMGPCRGQA 

DVPFCGSQSETSKPSPEAVATLASLQLPAGRT 

MSPQEVEGLMKQTVRQVQETLNXJEPDVAQH 

IXAHSHWGAEQLLQSYSEDPEPLLLAAGLCV 

HQAQAVPVRPDHCPVCVSPLGCDDDLPSLCC 

MHYCCXSCWNEYLTTRIEQNLVLNCTCPIAD 

CPAQPTGAFIRAIVSSPEVISKYEKAIXRGYVE 

SCSNLTWCTNPQGCDRILCRQGLOCGTTCSK 

CGWASCFNCSFPEAHYPASCGHMSQWVDDG 

GYYDGMSVEAQSKHLAKLISKRCPSCQAPIE 

lOJEGCLHMTCAKCNHGFCWRCIJCSWKPNH 

KDYYNCSAMVSKAARQEKRFQDYNBRCTFH 

HQAREFAVNLBNRVSAIHEVPPPRSFTFLNDA 

CQGLEQARKVLAYACVYSFYSQDAEYMDW 

EQQTENLELHTNALQIIXEETLLRCRDLASSL 

RLLRADCLSTG1S4ELLRR1QERLLAILQHSAQD 

FRVGLQSPSVEAWEAKGPNMPGSQPQASSGP 

EAEEEEEDDEDDVPEWQQDmDEELDNDSFS 

YDESENLDQEITFFGDEEEDEDEAYD 


475 


182S 


A 


3754 


1093 


96 


GTSRNQHSPKTHA'RSS/WPQPPPLELPPLQPQ 

ATGRRRRRTRTQQRTAALLTDGTTKTGAAW 

SRRPSLCWPSRTrGAPGAK*AVLVRSATPTTN 

PPNPQSPTGAAGKLRAPOCIAG/SEPSSQEPPP 

DGTRVRPASTTGVAQSPATRATPSLPCLHVPAP 

SRGQTLGVRTTGRASRLTVDRSRLSWPGRSA 

RSGGGRWRPNAPRORWPRAP*SWEPGSWTE 

PWRWPFPAAESPPHRCIYCTNHVSPAGPARPS 

HVYIIRATINSISHPLCRAQSSPWEAAGVWRR 

PAQPAFTSDVNINLUtKPRVKRHDLIYQFLGN 

TLWEEGRQRPPETLQPAR 


476 


1826 


A 


3758 


901 


521 


FFF0NGVSPCPQAGV*WHDUDSLQNIJPGF1C 
RFSYLSLPSSW\DYRHVPHIQANFCIF/M*RRG 
FTMLARMVSIS*PRDLPALASQSAGrrGVSHH 
APPQMDFTFALLCFAPKGOJPRQKEGGTI-NU 


477 


1827 


A 


3761 


843 


575 


GVtSAHCNLRUCHLPGSSNSPASASQVAGTlG 

ART1TS*IFVFLVETGFHHVSQDGLDLL/NFVI 

RPRRPLKVLGU3ACTRARLPSPLKEL 


478 


1828 


A 


3763 


267 


1240 


HLLSFHLWSASLDCLEQLSQERHVKGMLLGP 

PPVNESTKFSPSPWKLTPPMCSIPPVFPPKSGS 

PTTSWS/PSGHSKLEVERAQTGPFCLHIYCP*P 

GVTDNTTSLLHY1PFPRL\SGLVCFPAH*FPSY 

WTOHSFASQAWLRQVPEVSKHLQCPSAESLL 

TMEYHQPEDPAPCHCAGTAEAVIPBNHEVLAG 

PDEHPQDTDARDADGEAREREP/RRPSFAA»P 

VWGQP\ESPLPEASSAPPGPTLGTLPEVETIRA 

CSMPQELP*SPRTRQPEPDFYCVKW1PWKGE 

QTPnTQSTNGPLPSPCHHEHPLSSVEGEAPPA 
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SEQ ID 
NO: of 

Duel' 

cotidc 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 
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hod 
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ID NO: 
in 
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nucleotide 
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correspondi 

ngtofiist 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
ooiresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine G^Cysteine, 
I>-Aspartic Acid, EKilutamic Add, 
F=4'henylalanine, G=Glycine, IHOstidine, 
(=Isoleucine, K=Lysine, I^Loicine, 
M=Metluonine, N^Asparagine, P=ProIine, 
Q=Glutaniine, R=Ajiginine, S=Serine, 
T=Threonine, V=Valine, W=Tiyptophan, 
Y=Tyro3ine, XHJnknown, *=Stop oodon, 
A^xissible nucleotide deletion, V=possible 
nucleotide insertion 














EGSDHIG 


479 


1829 


A 


3766 


2 


21S2 


YSPIRLLEVCVPLPKIFIKRQAPLKVSLLQDLK 

DFFQKVSQVYVAroERLASLKTDlPSKTREEK 

MEDIFAQKEMEEGEFKNWIEKMQARLMSSS 

VDTPQQLQSVFESLIAKKQSLCEVLQAWNNR 

LQDLFQQEKGRKRPSVPPSPGRLRQGEESKIS 

AMDASPKNISPGLQNGEKEDRFLTTLSSQSST 

SSTHLQLPTPPEVMSEQSVGGPPELDTASSSE 

DVFDGHLLGSTDSQVKEKSTMKAIFANLLPO 

NSYNPIPFPFDPDKHYLMYEHERVPIAVCEKE 

PSSnAFALSCKEYRNALEELSKATQWNSAEE 

GLPTNSTSDSRPKSSSPIRLPEMSGGQTNRTTE 

TEPQPTKKASGNtLSFFRGTAGKSPDLSSQKRE 

TLRGADSAYYQVGQTGKEGTENQGVEPQDE 

VDGGDTQKKQLINPHVHXJFSDAMAKFyCRL 

YYAGEFHKMREVnJDSSEEDFmSLSHSSPWQ 

ARGGKSGAAFYATEDDRFUXQMPRLEVQSF 

LDFAPHYFNYITNAVQQKRPTALAKILGVYRI 

GYKNSQNNTEKKLDLLVMENLFYGRKMAQ 

VFDLKGSLKNKNVKTDTGKESCDWLLDENL 

LKMVRDNPLYIRSHSKAVLRTSIHSDSHFLSS 

HLIIDYSLLVGRDDTSNELWOIIDYIRTFTWD 

KKLEMVVKSTGILGGQG*MPTWSPELYRTR 

FCEAMDNYFLMVPDHCTGLGLNC 


480 


1830 


A 


3777 


251 


3 


QGCGSAGTLIHY**ECKMVQLLWKTV*QFLI 

KLNrKDPAnXDVYPNEVKNYVRTKTYTQMF 

VANFIMAKSWKQPTHPSVRT 


48] 


1831 


A 


3779 


333 


3 


EAAIRQPEPNILDVNQIFKDLAMIIHDQGDLID 
SDEANAESSEVLVERAPGQLQRPAXYYQKKSR 
KKMCLVVLVQTAnLICERIM*VVYTrKWSPPI 
VLPVSCFQGQKFN 


482 


1832 


A 


3780 


2 


371 


TGGRQGKNDHTSITEKPSRDFNRHUTQNI'M 
PNQDMKSSSNSLIIRKVQIKPm.YHHIFTRKA 
KMKTTDKTKYR'GFKAmiJHCSQDCKLQ'S 
/L'ENHFMIFPKAEQHrrYDTnPFLR 


483 


1833 


A 


3787 


43 


448 


LMKDLSPYVMETHYILNRLNER/RSMWRHnG 

KLPNTKDQEKILKAIRGRREVIQGS/RQQYRR 

PAAFSAAEKARRLWCS/VFNIERRNL/CEYPTK 

LSFNKGEMTFSDKTEFTTNRPSLKMLLKDRI 

QEEGJai4F»KEKCTKRKE 


A OA 


1834 


A 


3798 


1 


727 


l-FJ-Hi-rESRSVAQAGVQWCNLGSLQALPPGF\ 

SHSPASASRVAGTTGTRH'ARUFYIFSRDGVS 

PC*PGWS*SPDLVIRPP\RLPKCWbYRREPPRP 

A*FFVFLVE\QGFTMLARMVSIS*PQ/CI>LPAS 

VSQNAGITGVSHCAWPCXHFCFFGFFFrailESC 

SVAQAEVQWHDLRSLQAPPPGFTPFSCLSLPG 

SWDYRRPPPRPANFNCTSRDGVSPC'PGWSRS 

PDLVIRPPRPPKVLGLQA 


485 


1833 


A 


3802 


1 


239 


FFFFEMECLTVSQAGVQWYNLHSLQPLPPGF 

KQFSC\LSLPSSWD*RVPTSEPAKF/CVIF*DGV 

SHCQPGWSAWQPPLH 


486 


1836 


A 


3811 


378 




MSIMKKKSIFSAEFYKVSLPSLLLVHLLAIEWG 
FfflEIQLTIHQHFLNYELESDFVfflVEYM 


487 


1837 


A 


3814 


771 


320 


FDPDWTRAAGIRHEKKPKALAYRRENSPGDL 

PPPPLPPPEEEASWAL/GAEGSRQHVLPGAGA 

QWGffiSGPGRAPGSPAGAPPR»RGLAPVNSRP 

SFLSRGQGTSTCSTAGSNSSRGSSSSROSRGPO 

RSRSRSQSRSQSQRPGQKRRmTl 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
p^tide 
seq- 
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ng to first 
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Predicted end 
nucleotide 
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to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=AIanine OCysteine, 
D=Aspartic Acid, E^lutamic Acid, 
F=Hiaiylalanine. G=Glycine, IMIistidine, 
I=°lsoleucine, K=Lysine, I>Leucine, 
M^^iltiliitHune, N=Asparagine, P=Proline, 
Q=Glutamine, R!=Ai;ginine, S=Serine, 
T=Threonine, V=Valinc, W=Tiyptophan, 
Y=Tyrosine, X-Unknown, •=Stop codon, 
/=possible nucleotide deletion, V=possible 
nucleotide insertion 


488 


1838 


A 


3818 


1 


781 


FRACLLEUPYAPTLSWTACPPAMAGPRGLLP 

UaJAFOAGFSFVRGQVUKGCKVKTTFVT 

HVPCTSCAADCKQTCPSGWLRELPDQrrQDCR 

YEVQLOGSMVSMSGCRRKCRKQWQKACCP 

GYWGSRCHECPGGAETPCNGHGTCLDGMDR 

NGTCVCQENFRGSACQECQDPNRFGPDCQSV 

CSCVHGVCNHGPRGDGSCLCFAGYTGPHCD 

QELPVWQELGFP(?WPRLRKAPNC3KCLPG*H 

RNGLIATPNPCRP 


489 


1839 


A 


3822 


934 


669 


I'ffSKMESRSVTRLECSGAISAHLRLLGSSNSP 
ASAS*VAGTIGACHHAQUFVFLVETGFHHVG 
QDGLDLL/NLMDffPRPPKVLGFQA 


490 


1840 


A 


3825 


79 


9748 


GCQSCWPAWPRLRRRGPASAGARLGRKAPW 

GLPGRVQDGRPLRFCFYLRPRAPEIAPVLSGA 

ASRPEASGDCRAGRETAMATLEKUvIKAFESL 

KSFQQQQQQQQQQQQQQQQQQQQQQQPPPP 

PPPPPPPQLPQPPPQAQPLLPQPQPPPPPPPPPP 

GPAVAEEPLHRPKKELSATKKDRVNHCLTIC 

ENIVAQSVRNSPEFQKIXGIAMELFLLCSDDA 

ESDVRMVADECLNKVIKALMDSNLPRLQIJEL 

YKEKKNGAPRSLRAALWRFAELAHLVRPQK 

CRPTfLVNLLPCLTRTSKRPEESVQETLAAAVP 

KIMASFGNPANDNEIKVLLXAFlANLKSSSPn 

RRTAAGSAVSICQHSRRTQYFYSWLLNVLLG 

LLVPVH5EHSTLLILGVLLTLRYLVPLLQQQV 

KDTSLKGSFGVTRKEMEVSPSAEQLVQVYEL 

TLHHTQHQDHNWTGALELLQQLFRTPPPEL 

LQTLTAVGGIGQLTAAKEESGGRSRSGSIVELI 

AGGGSSCSPVLSRKQKGKVLLGEEEALEDDS 

ESRSDVSSSALTASVKDEISGELAASSGVSTPG 

SAGHDIITEQPRSQHTLQADSVDLASCDLTSS 

ATDGDEEDILSHSSSQVSAVPSDPAMDLNDG 

TQASSPISDSSQTTTEGPDSAVTPSDSSEIVID 

GTDNQYLGLQIGQPQDEDEEATGILPDEASEA 

FRNSSMALQQAHLLKmiSHCRQPSDSSVDKF 

VLRDEATEPGDQENKPCRIKGDIGQSTODDS 

APLVHCVRLLSASFLLTGGKNVLVPDRDVRV 

SVKALALSCVGAAVALHPESFFSKLYKVPLD 

TTEYPEEQYVSDILNYIDHGDPQVRGATAILC 

GTUCSILSRSRFHVGDWMGnRTLTGNrFSL 

ADOPLUIXTLKDESSVTCKLACTAVKNCVM 

SLCSSSYSELGLQUIDVLTLRNSSYWLVRTEL 

LETLAEDDFRLVSFLEAKAENUHRGAHHYTGL 

LKLQERVLNNWIHLLGDEDPRVRHVAAASL 

IRLVPKLFYKCDQGQADPWAVARDQSSVYL 

KLLhfflETQPPSHFSVSTrnOYRGYNLLPSrrD 

VTMENNl^RVlAAVSHELrrSTTRALTFGCCE 

ALCLLSTAFPVCIWSLGWHCGVPPLSASDESR 

KSCTVGMATMIL11LSSAWFPLDLSAHQDAL 

ILAGNLLAASAPKSLRSSWASEEEANPAATK 

QEEVWPALGDRALVPMVEQLFSHLLKVINIC 

AHVLDDVAPGPADCAALPSLTNPPSLSPIRRK 

GKEKEPGEQASVPLSPKKGSEASAASRQSJDTS 

GPVTTSKSSSLGSFYHLPSYLKLHDVLKATHA 

NYKVTLDLQNSTEKFGGFLRSALDVLSQILEL 

ATLQDIGKCVEEILGYLKSCFSREPMMATVC 

VQQIXKTLFGTOLASQFDGLSSNPSKSQGRA 

QRLGSSSVRPGLYHYCFMAPYTHFTQALADA 

SLRNMVQAEQENDTSGWFDVLQKVSTQLKT 

NLTSVTKNRADKNAIHNHIRLFBPLVIKALKQ 
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NO: of 
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to last amino 
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of peptide 
sequence 


Amino acid sequence (A==Alanine (XJysteine, 
E^*A^flrtic Acid R^^rliitRinic AciH 

F^=i>henylalaninej G=01ycinei ENHistidine, 
I=Isoleucine, K=Lysine, Lr^Leucine, 
M=^etfaionine, N^Aspaiagine, F=Proline, 
QKjhitamine, R=:A]:ginine, S=^erine, 
T=Threoaine, V=Valinei W=Tiyptophan, 
Y=Tyrosnic, X-Unknown, *=Stop codon, 
/=705sible nucleotide deletion, V=possible 
nucleotide insertion 














yiinCVQLQKQVLDLLAQLVQLRVNYCLL 

DSDQVFIGFVLKQFEYIEVGQFRESEAIIPNIFF 

FLVLLSYHlYHSKQnGIPKIIQLCDGIMASGR 

KAVTHAIPALQPIVHDLFVLRGTNKADAGKE 

LETQKEVWSMLLKUQYHQVLEMFILVLQQ 

CmENEDKWKRLSRQIADIILPMLAKQQMHI 

DSHEALGVUHLFEILAPSSLRPVDMLLRSMF 

VTPNTMASVSTVQLWISGILAILRVUSQSTED 

IVLSRIQELSFSFYLISCTVINRLRDGDSTSTLE 

BHSEGKQDCNLPEETFSRFLLQLVGILLEDIVT 

KQIXVEMSEQQHTFyCQElXJTLLMCUHIFKS 

GMFRRITAAATRLFRSDGCGGSFYTLDSLNLR 

ARSIVffTTHPALVLLWOQIUXVNHTDYRWW 

AEVQQTPKRHSLSSTKLLSPQMSGEEEDSDLA 

AKLGMCNRETVRRGALILFCDYVCQNLHDSE 

HLTWUVNfflQDLISLSHEFPVQDFlSAVHRNS 

AASGmQAIQSRCENLSTPTMLKKTLQCXEGI 

HLSQSGAVLTLYVDRLLCTPFRVLARMVDIL 

ACRRVEMLLAANLQSSMAQLPMEELNRIQEY 

LQSSGLAQRHQRLYSLLDRFRLSTMQDSLSPS 

PFVSSHPLDGDGHVSLETVSPDKDWYVHLVK 

SQCWTRSDSALLEGAELVNRIPAEDMNAFM 

MNSEFNLSIXAPCLSLGMSEISGGQKSALFEA 

AREVTLARVSGTVQQIPAVHHVFQPELPAEP 

AAYWSKLNDLFGDAALYQSLPTLARALAQY 

LVWSKLPSHLHLPPEKEKDIVKFVVATLEAL 

SWHUHEQIPLSLDLQAGLDCCCLALQLPGL 

WSWSSTEFVTHACSLIYCVHFILEAVAVQPG 

EQLLSPERRTNTPKAISEEEEEVDPNTQNPKYI 

TAACEMVAEMVESLQSVLALGHKKNSGVPA 

FLTOiRNIIISLARLPLVNSYTRVPPLVWKLG 

WSPKPGGDFGTAFPEIPVEFLQEKEVFKEFIYR 

1NTLGWTSRTQFEETWA71XGVLVTQPLVME 

QEESPPEEDTERTQINVLAVQAITSLVLSAMT 

VPVAGNPAVSCLEQQPRNKPLKAUDTRFGRK 

LSnRGIVEQEIQAMVSKRENIATHHLYQAWD 

PVPSLSPATTGALISHEKLLLQINPERELGSMS 

YKLGQVSIHSVWLGNSITPLREEEWDEEEEEE 

ADAPAPSSPPTSPVNSRKHRAGVDIHSCSQFL 

LELYSRWILPSSSARRTPAILISEVVRSLLWS 

DLFTBRNQFELMYVTLTELRRVHPSEDEILAQ 

YLVPATCKAAAVLGMDKAVAEPVSRLLESTL 

RSSHLPSRVGALHGVLYVLECDLLDDTAKQL 

IPVISDYLLSNLKGIAHCVNIHSQQHVLVMCA 

TAFYLIENYPLDVGPEFSASnQMCGVMLSOS 

EESTPSnYHCALRQl FRI 1 1 SPni SRI TIAFW 

VKLSVDRVNVHSPHRAMAALGLMLTCMYT 

GKEKVSPGRTSDFNPAAPDSESVTVAMERVS 

VLFDRKKGFPCEARWARILPQFLDDFFPPQ 

DIMNKVIGEFLSNQQPYPQFMATWYKVFQT 

LHSTGQSSMVRDWVMLSLSNFTQRAPVAMA 

TWSLSCarVSASTSPWVAAELPHVISRMGKLE 

QVDVNLFCLVATDFYRHQIEEELDRRAFQSV 

XJBWAAPGSPYHRLLTCLRNVHKVTTC 


491 


1841 


A 


3826 


469 


302 


SNPPASASRVAGrroVHQHAWUFVFLVHvlEF 
HHVGQAVLKLUS6DLPVSASQSA 


492 


1842 


A 


3836 


392 


88 


VAPSPMIMPDLYFYRDPEE1EKEE*AAAEK\EE 
FQSEWTAWyP/EFTATQSEVADWFKDMQVP 
SVPIQQFPTEDWST*PTMNDWSATSTAQTTE 
WVRITTEWP 
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Amino acid sequence (A=Alanine OCysteine, 
D=Aspartic Add, Es=Glutamic Acid, 
F^flKnylalanine, OMjlycine, H=Bistidin^ 
Msoleudne, K=Lysii«, I^Ceucine^ 
M^Me&ionine, N^^'Aspaxagine, PHPioBne, 
Q=01utaniinB, R=°Aigiiiiiie, S=Serine, 
T>=Thrconine, V=Valine, W=Tiyptophan, 
Y=a"yrosine, X=Unknown, *=Stop codon, 
A=possibIe nucleotide deletion, ^xissible 
nucleotide insertion 




1843 


A 


3838 


19 


380 


TPSDMNRAFETDTQSIGEKNRSPSEPDYFERK 
KnCRS*HCAHlRYKIDQPEDIPLK\EFLCKHSK 
CTATLSMRNMSLMKKKCSFSEEF\LAFFPS1X 
VCmLAKLGFYIBIHLTrFNNTF 


494 


1844 


A 


3845 


2 


352 


l-l-fLRRSL/DSVAQAEAQWLVELGLLQAPPPGF 
KPISLP\(HJPSSWDYGRPPPa>ANFCIF/M*RRG 
FTVLARMVIJS*PCDPPTLASQGTArrGMSYH 
ARPQDIDFLYAHQGRCWFRLL 


495 


1845 


A 


3847 


1774 


40 


Dn<>RRAKEGMGQDEAQFSVEMPLTGKAYL 

WADKYRPRKPRFFNRVHTGFEWNKYNQTHY 

DFDNPPPKIVQGYKFNIFYPDLIDKRSTPEYFL 

EACADNKDFAILRFHAGPPYEDIAFKrVNREW 

EYSHRHGFRCQFANGIFQLWFHFKRYRYRR* 

RPWGTAGRCPRGHSKGASVKLWTPGPLSGL 

QGRGFTSHLRPHLSFARPQFPPI*KGGHH*AC 

HGELRRHWDRLA*GPDATEGALGASFEHEG 

GQQPPADLTVQADTLHRPSARLGGAHRACPK 

RRPHRVLWRWARGAWAWRCQAREKQETQG 

QPCHITGHPLGREAEPAAAGAAPALAHRPPF 

ARTGSTEVPGPCWRPIRHCRRDPLWrPTLCVRD 

WPPTHPVLAGGVHFPAAG/IGGCVEVPVSVN 

VMGTKSH'AVLPPPPSTGPGGQGLPEGWGLE 

KGEGLPPGIPPPGLLTGPW\SMRPVTPSFAHIR 

TVAPSHSPFSGQEGRGPHGCHSPGR\SGP\AGR 

LVLQHPTGTSPTEAKRKVPPGPPEGHPTSPVT 

SPRPPTAPPRHPASSGNSSVCFSKKTCRWEKK 

SFVLMEiAYWQDRMFF 


496 


1846 


A 


3849 


830 


442 


AKSPLPLG*IQWR/NLGSLKUUJPGFK*FTCXO 
IXSSWDYRSIJPPRPVNFCILVELGFHHVDQAG 
LKLLTSSALPALASQSAEITGMSHRIWPLPLLR 
lUWIRIRAPPQRLPFNLrrSLKALSPNMATF 


497 


1847 


A 


3859 


2 


393 


ALRKTRRDGIARTGAQPAASWKGTNNYPWR 

LHvIAGRPGSQEQSKDRGTGSLPPPSQRPLGPS 

PEQAGPSPPPPOIPRGGGSSSSEQP/PQULFVPR 

RFPAPKKOLPSDTPHSKAPPTPHLILGGEDSQ 

VPIL 


49S 


1848 


A 


3860 


253 


634 


knastvyssqgdpksffflLrwslalvaqag 
eq*rdlsslqfpppgfk*fsclslpsswd\yrc3p 

LPCXANR'FLVETGFHHVGQADLKLLTSGDP 

ptsasesagitgvshrawprihflywktffl 


499 


1849 


A 


3863 


423 


263 


•apsqisvaflyaa/dkxfhcei»kkipfiias/dki 

K^GI^a,TKEVKYLYTENymA^KEIK/DTDKW 
KDILr*WIGKINl«KMSTPPKAIYRFNAIPTKlP 

mtffteieksiikfiwnhkkppntqsnieqke*s 
fcsillwvfooflwfhmnfmidfsisvknvigi 
lvgialnl 


SOO 


1850 


A 


3865 


2 


13246 


lprgclwclqrspiparpqpsrparsplplfp 

dlrpwasdu3imgdaegedevqflrtddev 

vujcsatvlkeqlklclaaegfgnrlcflep 

tsnaqnvppdlaiccfvleqslsvralqeml 

antveagvessqggghrtllyghaiixrhah 

srmylsclttsrsmtdklafdvqlqedatge 

acwwtmhpaskqrsegekvrvgddiilvsvs 

serylhlstasgelqvdasfmqilwnmnpic 

srceegfvtgghvlruhghmdecltispads 

ddqrrlvyyegqavctharslwrleplris 

wsgshlrwgqplrvrhvttgqylaltedqg 

lvwdaskahtkatsfcfriskekldvapkr 

dvegmgppekygeslcfvqhvasglwltya 
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Amino add sequence (A=Alanine OCysteine, 
I>=Aspartic Acid, EKjlutamic Acid, 
F=Phenylalanine, G=Glycme, H=Histidme, 
Msoleucin^ K=Lysine, L=Lencine, 
M=Meiiuonine,N=Aspaiaginc, P=Pn)line, 
Q=Glutamine, R=Arginine, S=Serine; 
T=Threonine, V=VaIine, W=Tryptcq3iian, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
^^jossible DucJeotide deletion, \=possibIe 
nucleotide insertion 














APDPKALRLGVLKKKAMLHQEGHMDDALSL 

TRCXJQEESQAARMIHSTNGLYNQFIKSLDSFS 

GKPRGSGPPAGTALPIEGVILSLQDUIYFEPPS 

EDLQHEEKQSKLRSLRNRQSLFQEEGMLSMV 

LNCIDRLNVYrTAAHFAEFAGEEAAESWKEI 

VNLLYELLASLmONRSNCALFSTNLDWLVS 

KLDRLEASSGILEVLYCVLIESPEVLNnQENHI 

KSnSLLDKHGRNHKVLDVLCSLCVCNGVAV 

RSNQDLITENLLPGREUXQTNLINYVTSIRPN 

IFVGRAEGTTQYSKWYFBVMVDEVTPFLTAQ 

ATHLRVGWALTEGYTPYPGAGEGWGGNGV 

GDDLYSYGFDGLHLWTOHVARPVTSPGQHL 

LAPEDVISCCLDLSVPSISFRINGCPVQGVFESF 

NLDGLFFPWSFSAGVKVRFIXGGRHGEFKF 

LPPPGYAPCHEAVLPRERLHLEPKEYRJREGP 

RGPHLVGPSRCLSHTDFVPC3PVDTVQIVLPPH 

LERIREKLAENIHELWALTRIEQGWTYGPVRD 

DNKRLHPCLYDFHSLPEPERNYNLQMSQETL 

KTLLALGCHVGMADEKAEDNLKKTKLPKTY 

MMSNGYKPAPLDLSHVRLTPAQTTLVDRLAE 

NGHNVWARDRVGQGWSYSAVQDIPARRNPR 

LVPYRLLDEATKRSNRDSLCQAVRTLLGYGY 

NDEPPDQEPSQVENQSRCDRVRIFRAEKSnV 

QSORWYFEFEAVTTGEMRVOWARPELRPDV 

ELGADELAYVFNGHRGQRWHLGSEPFGRPW 

QPGDWGCMIDLTENTIIFTLNGEVLMSDSGS 

ETAFREIEIGDGFLPVCSLGPGQVGHLNLGQD 

VSSLRFFAICGLQEGFEPFAINMQRPVTTWFS 

KGLPQFEPVPLEHPHYEVSRVDGTVDTPPCLR 

LTHRTWGSQNSLVEMLFLRI^LPVQFHQHFR 

CTAGATPLAPPOLQPPAEDEARAAEPDPDYE 

NLRRSAGGWSEAENGKEGTAKEGAPGGTPQ 

AGGEAQPARAENEKDATTEKNKKRGFLFKA 

KKVAMMTQPPATPTLPRLPHDVVPADNRDD 

PEnLNTTTYYYSVRVFAGQEPSCVWAGWVT 

PDYHQHDMSFDLSKVRWTVTMGDEQQNV 

HSSaLKCSNCYMVWGODFVSPOQQGRISHrDL 

VIGCLVDLATGUVlTFTANGKESNTFFQVEPN 

TKIPPAVFVLPTHQNVIQFELGKQKNIMPLSA 

AMFQSERKNPAPQCPPRJUSMQMLMPVSWSR 

MPNHFLQVETRRAGERLGWAVQCQEPLTMM 

ALHIPEENRCMDILELSERLDLQRFHSHTLRL 

YRAVCALGNNRVAHALCSHVDQAQLLHALE 

DAHLPGPLRAGYYDLLISIHLESACRSRRSML 

SEYIVPLTPETRArrLFPPGRSTENGHPRHGLP 

GVGVTTSLRPPHHFSPPCFVAALPAAGAAEAP 

ARLSPAIPLEALRDKALRMLGEAVRDGGQHA 

RDPVGASVEFQFVPVLKLVSTLLVMOIF(H)E 

DVKQILKMIEPEVFTEEEEEEDEffiEGEEEDEE 

EKEEDEEETAQEKEDEEKJ^EEAAEGEKEEG 

LEEGLLQMKLPESVKLQMCHLLEYFCDQELQ 

HRVESLAAFAERYVDKLQANQRSRYGIXKA 

r oM 1 AAn 1 AKK 1 KnrKorrQiiQiNMlJUQFKDG 

TDEEDCPLPEEIRQDLLDFHQDLLAHCGIQLD 

GffiEEPEEETTUjSRIAlSLLEKVRLVKKKEEK 

PEEERSAJffiSKPRSLQELVSHMWRWAQEDF 

VQSPELVRAMFSLLHRQYDGLGELLRALPRA 

YTISPSSVEDTMSLLECLGQIRSLLIVQMGPQE 

ENUvflQSlGNIMNNKVFYQHPNlMRALOMHE 

TVMEVMVNVLCXJGESKEIRFPKMVTSCCRFI. 
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Amino acid sequence (A==Alanine C^Cysteine, 
D=Aspaitic Acid, B-Glutamic Acid, 
F^henyblanine, G=01ycine; {MBstidine, 
I=Isoleucine, K=Lysine, L=C«ucine> 
M°^etliionine, N=Asparagine, F=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
l^Threonine, V=Valine, W=TryptoiAan, 
Y='rjTOsinc, XHJnknown, •=Stop codon, 
/=possible nucleotide deletion. \=possible 
nucleotide insotion 






■ 








CYFCRISRQNQRSMFDHLSYLLENSGIGLGM 

QGSTPLDVAAASVIDNNELALALQEQDLEKV 

VSYLAGCGLQSCPMLVAKGYPDIGWKPCGG 

ERYLDFLRFAVFVNGESVEENANWVRLUR 

KPECFGPALRGEGGSGLLAAIEEAIRISEDPAR 

DGPGIRRDRKREHFGEEPPEENRVHLGHAIMS 

FYAAUDLLGRCAPEMHUQAGKGEALRIRAl 

LRSLVPLEDLVGnSLPLQIFTLGKDGALVQPK 

MSASFVPDHKASMVLFLDRVYGIENQDFLLH 

VLDVGFLPDMRAAASLDTATFSTTEMALAV 

NRYLCLAVLPLmCCAPLFAGTEHRAIMVDS 

MUmnrRLSRGRSLTKAQRDVIEDCLMSLCR 

YIRPSMLQHLLRRLVFDVPILNEFAKMPLKLL 

TNHYERCWKYYCLPTGWANFGVTSEEELHL 

TRKLFWGIFDSLAHKKYDPELYRMAMPCLC 

AIAOALPPDYVDASYSSKAEKKATVDAEGNF 

DPRPVETLNVnPElCLDSFINKFAEYTHEKWAF 

DKIQNNWSYGHsHDEELIClHPMLRPYKTFSE 

KDKEIYRWPDCESLKAMIAWEWTIEKAREGE 

EEKTEKKKTAKISQSAQTYDPREGYNPQPPDL 

SAVTLSRELQAMAEQLAENYHNTWGRKKKQ 

ELEAKGGGTHPLLVPYDTLTAKEKARDREKA 

QELLKFLQMNGYAVTRGLKDMELDSSSIEKR 

FAFGFLQQLLRWMDISQEFIAHLEAWSSGRV 

EKSPHEQEIKFFAKILLPLINQYFTNHCLYFLS 

TPAKVLGSGGHASNKEKEMTTSLFCKLAALV 

RHRVSLFGTDAPAWNCLHILARSLDARTVM 

KSGPEIVKAGLRSFFESASEDIEKMVENLRLQ 

KVSQARTQVKGVGQNLTYTTVALLPVLTTLF 

QHIAQHQFGDDVILDDVQVSCYRTLCSIYSLG 

rnornrVEKLRPALGECXARLAAAMPVAFLE 

PQLNEYNACSVYTIXSPRERAILGIJT4SVEEM 

CTDIPVLERLMADIGGLAESGARYTEMHIVIE 

riLPMLCSYLPRWWEROPEAPPSALPAGAPPP 

CTAVTSDHLNSLLGNILRIIVNNLGIDEASWM 

KRIAWAQPIVSRARPELLQSHFIPTIGKLRKR 

AGKWSEEEQLALEAKAEAQEGELLVRDEFS 

VLCRDLYALYPLLIRYVDNNRAQWLTEPNPS 

AEELFRMVGEIFIYWSKSHNFKREEQNFWQ 

NEINNMSFLTADNKSKMAKAGDIQSGGSDQE 

RTKKKRRGDRYSVQTSLIVATLKKMLPIGLN 

MCAPTDQDLITLAKTRYALKDTDEEVREFLH 

NNLHLQGKVEGSPSLRWQMALYRGVPGREE 

DADDPEKIVRRVQBVSAVLYYLDQTEHPYKS 

lOCAVWHKLLSKQRRRAVVACFRMTPLYNLP 

THRACmffLESYKAAWILTEDHSFEDRMIDD 

LSKAGEQEEEEEEVEEKKPDPLHQLVLHFSRT 

ALTEKSKLDEDYLYMAYADIMAKSCHLEEG 

GENGEAEEEVEVSFEEKQMEKQRLLYQQARL 

mRGAAEMVLQMISACKGETGAMVSSTLKL 

GISILNGGNAEVQQKMLDYLKDKKEVGFFQS 

IQAIMQTCSVLDLNAFERQNKAEGLGMVNE 

DGTVINRQNGEKVMADDHTXJDLFRFLQLLC 

EGHNNDFQNYLRTQTGNrrnNinCTVDYLL 

RLQESISDFYWYYSGKDVIEEQGKBNFSKAM 

SVAKQVFNSLTKYIQGPCTGNQQSLAHSRLW 

DAWGFLHVFAHMMMKLAQDSSQIELLKEL 

LDLQKDMVVMLLSLLEGNVVNGMIARQMV 

DMLVESSSNVEMILKFFDMFLKLKDIVGSEAF 

QDYVTDPRGLISKKDFQKAMDSQKQFSGPEI 
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corre^nding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=AIanine C^Cysteine, 
I>'A^artic Acid, E^hHamic Acid, 
FHPhoiylalanine; G=01ycinc; It4Iistidine, 
I=Isoleucine, K=Lysiiiei IM^ucin^ 
M=Mellucnine^ N^Aspaiagine, FNProlinei, 
Q=Glutamine, R=Aigbiine, S^^erinei 
■r=Threonine, V=Valine, W=Tiyptophan, 
Y-Tyrosinc, X=Unknowii, *=StDp codon, 
/=possible nucleotide deletion, V^ossible 
nucleotide insertion 












•. 


QFLLSCSEADe>(EMINCEEFANRFQEPARDIG 

FNVAVLLTNLSEHVPHDPRLHNFLELAESILE 

YFRPYLGRIEIMGASRRIERIYFEISETNRAQW 

EMPQVKESKRQFIFDWNEGGEAEKMELFVS 

FCEDTIFEMQIAAQISEPEGEPETDEDEGAGA 

AEAGAEGAEEGAA(a.EGTAATAAAGATARV 

VAAAGRALR(aSYRSLRRRVRELRRLTAREA 

ATAVAALLWAAVTRAGAAGAGAAAGALGL 

LWGSLFGG<a:.VEGAKKVTVTELLAGMPDPT 

SDEVHGEQPAGPGGDADGEGASEGAGDAAE 

GAGDEEEAVHEAGPGGADGAVAVTDGGPFR 

PEGAGGLGDMGDTTPAEPPTPEGSPILKRKLG 

ATXjVEEELPPEPEPEPEPELEPEKADAENGEK 

EEVPEPTEEPPKKQAPPSPPPKKEEAGGEFWG 

ELBVQR\TCFLNYLSRNFYTLRFLALFLAFAIN 

FDLLFYKVSDSPPGEDDMEGSAAGDVSGAGS 

GGSSGWGLGAGEEAEGDEDENMVYYFLEES 

TGYMEPALRCa-SLLHTLVAFLCnGYNCLKVP 

LVIFKREKELARKLEFDGLYITEQPEDDDVKG 

QWDRLVLNTPSFPSNYWDKFVKRKVLDKHG 

DIYGRERIAELLGMDLATLEITAHNERKPNPP 

PGLLTWLMSIDVKYQIWKFGVIFTDNSFLYLO 

WYMVMSLLGHYNNFFFAAHLLDIAMGVKTL 

RTILSSVTHNGKQLVNITVGLLAVVVYLYTVV 

AFNEFRKFYNKSEDEDEPDMKCDDMNITCYL 

FHMYVGVRAGGGIGDEIEDPAGDEYELYRW 

FDIITFFFVIVILLAnQGUIDAFGELRDQQEQV 

KEDMETKOTCGIGSDYFDTTPHGFETHTLEE 

HNLANWvdFFLMYLINKDETEHTGQESYVWK 

MYQERCWDEFPAGDCFRKQYHMJLS 


SOI 


1831 


A 


3869 


467 


663 


VIVAIYCQUFDKGAK'nQ*PFQQIAL/CKRMK 
LGPC3TPCGKINSEWIRELSVRVKTIKHLEIGV 
N 


S02 


1832 


A 


3888 


1042 


724 


SOvIQWRDLTPLQPLPPRFKQFSCLSLPGSWD 
YRHAPVPLLTNF\*FLVEMGFCYVGQAGRKLL 
ASSDQSALASQSAGITGlSrAPGPPFFFLNFEA 
GSCSVAQAGVQ 


503 


1833 


A 


3891 


1773 


1193 


EVDSQSGVQ»QAPGSLQLQTPGLK/VSCXLSR 
QDYRSSIfm^CCyYYYYY/VFL'RRGLTTL 
VQGGLKLLPSSNPFASAP*TAGITGMSHCAGP 
HFNr*MFKiaSCIRE»F»HTRm>IPFLlLFFKET 
WVLLCYPGWPQIPGLKPSSCLRLLSSWDHRC 
APPCPASFFIFHVDRVSPPCPGLVSnTKMLLL 
L 


304 


1834 


B 


3896 


279 


70 


KfVSKSKSILMSYNHVELTFSDMKKMPEAFRR 

TQKHITYLIPYQVIFWSTGKDAMRSFMMPFY 

QKEYYENQ* 


505 


18SS 


A 


3899 


2 


1396 


EPGVPTKKTWFDKPDFNRTNSPGFQKKVQFG 

NElfnCLELRKVPPELNNISKLNEHFSRFGTLV 

NLQVAYNGDPEGALIQFATYEEAKKAISSTEA 

VLNNRFIKVYWHREGSTQQLQTTSPKVMQPL 

VQQPILPWKQSVKERLGPVPSSnEPAEAQS 

ASSDLPQVLST\LLA*QKQCnQLL/WKAAQKT 

LLVSTSAVDNNEAQKKKQEALKLQQDVRKR 

KQEILEKHIETQKMLISKLEKNKTMKSEDKAE 

IMKTLEVLTKNTTKLKDEVKAASPGRCLPKSl 

KTKTQMQKEIXDTELX)L"YKKMQAGEEVEEL 

RRKYTELQLEAAKRGILSSGRGRGIHSRGRGA 

VHGRGRGRGRGRGVPGHAWDHRPRALEIS 
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Amino acid sequence (A=Alanine O^Cysteine, 
I>Aspartic Acid, E^Glutamic Acid, 
F^Phcnylalanine, G=Glycine, H=Histidine, 
l=Isoleucine, IC=Lysine, L=l«ucine, 
M=Mahionine, >^Asparagine, I^Proline, 
Q=Glutaniine, R=Aiginine, S=Serine, 
T=Threonine, V=Valine, W=Tiyptophan, 
Y=Tyn)sinc, X=Unlmown, *=Stop codon, 
^=^ssible nucleotide deletion, possible 
nucleotide insertion 














AtlKSDKliDLLPHFAQYGEIEDCQEDDSSLHA 
VTTFKTRAEAEAAAVHGARFKGQDLKLAWN 
KPVTNISAVET^VEPDEEEQREinA 


306 


1856 


A 


3911 


1952 


919 


DAELSGTLSLVLTQCCKRDCDTVQKLASDHK 

DlHSSVSRVOKAIDKNFDSDISSVGlDGCWQA 

DSQRLLNEVMVEHFFRQ(»4LDVAEELCQES 

GLSVDPSQKEPFVELNRILEALKVRVLRPALE 

WAVSNREMLLAQNSSLEFKLHRLYFISLLMG 

GTTNQREALQYAKNFQPFALNHQKDIQVLM 

GSLVYLRQGENSPYVHLLDANQWADICDIFT 

RDACALLGLSVESPLSVSFSAGCVALPAUNDC 

AVIEQRQCTGVWNQICDEU>IEWDLG»KSAGY 

HSIFACPILRQQTTDNNPPMKLVCGHIISRDAL 

NKMFNGSKLKCPYCPMEQSPGDAKQIFF 


507 


1857 


A 


3936 


439 


18 


SHPFSPAPGICPDAPPPLPRPSKGLGHPGTAGA 

PGSGARCHPPSTCSPSWASPG*GAKASPALPR 

SHGVTLLCKAQAHLCRGEDSKDASGSTSQA 

WEPG*GAWGMPRCQGPALGSCFCPPGTTVQ 

RPAKQRDKRNRHLOl 


50S 


1858 


A 


3944 


120 


412 


WCPAGTLDFPGPQEMVLLEIEVMNQLNHRNL 
lQLYAAIETPHErVLFME\YECPK*W*GLGGGT 
TRHGASRpGVCAHSIEGGELFERrVDEDYHLT 
EV 


509 


1859 


A 


3949 


31 


392 


LTKTPSPREKGRGVLSVLLMMI*KCRVIFVKIP 
MVFFLQNFC/RnLNVA\WTQD*PNTL*KEQRG 
1TFSDSKS*YKATKIKTMWYCHKNRYID/EKN 
RIEIPEINPaCDKOFRKLSMTrQ 


510 


1860 


A 


3954 


1013 


885 


FSETOACCPRLEHSGRIEAHCSLNIPGSSDPPT 
SASSVAATTG 


511 


1861 


A 


3956 


1 


10S4 


PPAWAPRSPLIWAPTSGRHPCRAALPWSTSSV 

RWQPSEKQPPPPAHRGPADSLSTAAGAAELS 

AEGAGKSRGSGEQDWVNRPKTVRDTIXALH 

QHGHSGPFESKFKKEPALTAVARTARKRKPS 

PEPEGEVGPPK\TreRPSRGCPHPQRGSRSP*L 

LHPLLCIJIHHPLPHLIPTGPHRLKRPRM\P\SP 

MAALILVADNAGOSHASKDANQVHSrrRRN 

SNSPPSPSSMNQRRLGPREVGGQGAGNTGGL 

EPVHPASLPDSSLATSAPLCCTLCHERLEDTH 

FVQCPSVPSHKFCFPCSRQSIKQQGASGEVYC 

PSGEKCPLVGSNVPWAFMQGEIATILAGDVK 

VKKERDS 


512 


1862 


A 


3957 


1086 


3 


QDRARLDCSSATSAHCNLRLPGS*DSPASASR 

VAGTTDTHHHTWLILGSSVQTGFDHVGQAG 

LELLTSGDPPISASESAGIMGMSHCVWP*SWG 

LSHHMAPPQGDGGRARGTPGPEQSFWNLSC 

H*HlCQVPS*IAfrQL/FWGRHQYNPTMKRGK 

LRHREACSLPIJPGEGEP(aLQPSS\*SQNPCSSPL 

FHHQL*AWLWCPELLLQOQARRH*RSPPS/FK 

CPATLSLTAWSQTKIILRSQELXJUPWL*RAL*H 

PP\CHWPSRRSLGDPIXPRSQG*RDGT*ASTFC 

SYF*DTESHLVAQAGVQWRDLGSLQPPCPRL 

K.\Kr i>KL&rl'obY IHKYVrSHLAESCISSRDRIP 

PSRPDRSRNSNSLSR 


513 


1863 


A 


3961 


3038 


476 


VALTTSMCCNKQVrVlDKIKSASIADRCXSALH 

VGDHn-SroGTSMBYCTLAEATQFLANTTDQ 

VKLEILPHHQTRLALKGPDHVKIQRSDRQLT 

WDSWASNHSSLHTNHHYNTYHPDHCRVPAL 

TFPKAPPPNSPPALVSSSFSPTSMSAYSLSSLN 

MOTLPRSLYSTSPRGTlylMRRRLKKKDFXSSL 
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nucleotide 
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cotrespondi 

ng to first 

amino add 

residue of 

peptiie 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
ofpq)tide 
sequence 


Amino acid sequence (A=Alanine C=CystBine, 
I>=Aspartic Acid, &=01utamic Acid, 
r=4Tienylalanine, O=0iycme; H=Histidine, 
I=lsoleucine, K^^'Lysine, L^Leucine^ 
M=Methionine, N^Asparagine, P=Proline, 
Q=Glutamine, R=Argbiinc, S=5erine, 
■p=Threonine, V=Valine, W=Tiyptoijian. 
Y=Tyrosine, X^Unknown, ♦=Stop codon, 
A^^ssible nucleotide del^on, V^ssible 
nucleotide insertion 














SLASSTVGLAGQWHTETTEWLTADFVTGF 

GIQLQGSVFATETLSSPPLISYIEADSPAERCG 

VLQIGDRVMAINGIPTEDSTFEEASQLUU>SSI 

TSKVTLEIEFDVAESVIPSSGTFHVKLPKKHN 

VELOmSSPSSRKPGDPLVISDIKKGSVAHRT 

GTUILQDKLLAIDNIRLDNCSMEDAVQILQQC 

EDLVKLKIRKDEDNSDEQESSOAiryrVELKR 

YGGPLG\rnSGTEEP\FDL«IISSLTICG(H-AERT 

GAIH1GDRIL\AINSSSLKGKPLSEAIHLLQMAG 

ETNTTUCIKKQTDAQSASSPKKFPISSHLSDLGD 

VEEDSSPAQKPGKLSDMYPSHGCPSVDSAVD 

SWDGSAVIDTS\YGTEOT\SFQASGY\NFNTYD 

WRSPKQRGS\LSPVT\KPRSQTYPDVGLSYED 

WDRSTASGFAGAAUJSAETEQEENFWSQALE 

DLJETCXjQSGILRELEATIMSGSTMSLNHEAPT 

PRSPAGSDRPSFQERSSSRPETYSQrrRSNTLPS 

DVGRKSVTLRKMKQEIKEIMSPTPVELHKVT 

LYipSDMEDFOFSVADGLLEKOVYVKNERPA 

GPGDLGGLKPYDELLQVNHVRTRDFDCCLV 

VPLIAESCTOCLDLVISRNPLASQKSIDQQSLPG 

D'SEQNSAFFQQPSHGGNLETREPTNTL 


514 


1864 


A 


3967 


833 


860 


LEK<yjVS(»lATKRLARQLGLIRKKSIAPANG 

NLGRSKSKQLFDYUVIDFESTCWNDGKHHH 

SQEIIEFPAVLLNTSTGQIDSEFQAYVQPQEHPI 

LSEFCMELTGIKQAQVDEGVPLKICXSQFCK 

WlHKIQQQKNUFATGlSEPS/DF'SKIMaCYL 

VR*RISYTy*SKHKSKGC 


515 


1865 


A 


3969 


492 


182 


CRFWGlSTHCDTCDPLSPQnTO**EGDLWSL 
DLLGPEFLARKPLFKTKTYQSTF»SISKNE/FTC 
PNFIIEEGTDLIF\*QVKHNPCHRLTPEEGTVQL 
NRADS 


516 


1866 


A 


3977 


2 


1357 


KMIXyQKESNYIRLKRAKMDKSMFVKIKTLGI 

GAFGEVCLABKVDTKALYATICrLRKKDVIi 

RNQVAHVKAERDILAEADNEWWRLYYSFQ 

DKDNLYFVMDYIPGGDMMSLLIRMGIFPESL 

ARFYIAELTCAVESVHKMGFIHRDIKPDNILID 

RDGHnaTDFGLCTGFRWTHDSKYYQSGDHP 

RQDSMDFSNEWGDPSSCRCGDRLKPLERRAA 

RQHQRCIAHSLVGTPNYIAPEVLLRTGYTQL 

CDWWSVGVILFEMLVGQPPELAQTPLETQM 

KVDWQTSLHIHPQAKLSPEASDLIIKLaflGPE 

DRLGKNGADEIKAHPIF*NQFDFSQ*PH)SRS 

ArKQrr^Nxil Ir lUlaNrlJl^VJL>rJJKX.WSDDN 

EEENVNDTLNGWYKNGKHPEHAFYEFTFRRF 

FDDNGYPYNYPKPIByEYlNSQOSEQQSDEDD 

QNTGSEIKNRDLVYV 


517 


too/ 


A 




1 3 JO 


lUZZ 


WUWmY\^1 fail I T7G1/d/l DC/ *UUUUUH\n Ul ' \F 

VFLDRVPLCHPGWSAWQSQVT/VNLPPSWD 
♦RCRPPH/LANLCNFCRDVSFTTLPRLVLNTWA 
QAIFQPQPPKVLGLQV 


518 


1868 


A 


3986 


974 


666 


SPe^HPrrQAOVQWHHLSSLQPLPPGFK*F 
SCFSLPE*LGYRHVPPCLANSVFSVEMQ\FLH 
VGQAGLEU,TSGDLPAIASQSAGITG\SHRAR 
PENGFENIF 


519 


1869 


A 


3994 


751 


126 


NQGLRHVGLCRTCLVNQMFASSILGKSHHHS 

USINQGHNALWKAAG\PLPLKAGYC\QSFSPC 

DSLKYGVSWDEKDLTVPQRDTHKRSVLRWIS 

QRGKVLAVEMEEGHCLLVLPLGTBCLGIKVPIV 

HLFSSEMGEVNRPMVG\ARHVYSNAALLSPTP 
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seq- 
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SEQ ID 
NO: of 

seq- 
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ID 
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uiumv oCIQ 
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nucleotide 
location 
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to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OQ'steine, 
D-Aspaitic Acid, &<jlutamic Acid, 
r^nenylalanine, G=Glycinei H=4Iistadine, 
I=Isoleucine, K=4-ysine, LFl^ucine, 
M=Metliionine, N=Aq>aiagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
1— inreoaine^ V=Valine, W=liyptopnan, 
Y-IVrosine, X=Unlcnown, '"^top codon, 
/— jjossibte nucleotide deletion, V=possible 
nucleotide insotion 














LRCLGGEKHKSGLHARPVIVPSLELHYDMDSI 
AHWADLLLIULPSYYIPFC 


520 


1870 


A 

n. 






oyo 


Qorlu^LJ^obWIJYRHM*PRl.ANF*TAFFCIU)R/ 
SLALLPRLVSNSWPQAE-PPRPPKVLGLQT 


521 


1871 


A 


4011 


1346 


1178 


FFF*tTVSCS/^*AGVK.SHDNSSLQPPSPG\SSN 
PPTSASHVAGATGTHHHAWLLSV 






A 


4015 


2 


377 


QGIALLHIMGESVKHVTGGYKLRTRPLEFAA 
lODYLDTFALKLGTIDRIAQRIIKEEIEYLVELR 
EYGPVYSTWSALEGELAEPLBGVSACIGNCST 
AL'ELTDDMTEDFLFVLREYILYSDSMK 


SOI 


J 8/3 


A 

A 


4018 


341 


19 


ERVmNQlQQAQRSPHU-NAKKiJS/PRPNIVELP 
KATCEVCKTSKS/GQVIYKGVSIRLRANFLAEP 
L*KRREWDEAIKVLKEKQVFLSKMVYPANLSF 
GNEGDTTSFPAK 


524 


1874 


A 


4020 


1067 


743 


FWXWSIVDSVAQAGVKWCNLGSLQAPPPGF 
TPFSCLSli»SSWDYRHPPPRLAN*LTNFLCF** 
RQGFTVLARMVLIS*PHDLPASASQSAGrrGL 
SHCSWPTSSILS 




1875 


A 


4021 


781 


351 


QFRVIFFFLRRSHSVAQAGMQWHDHSLLQPL 

PPRLKQ/F/SHLSPPSIWDYRRVPPCLVNFSIFF 

VETGSCQPCLQLLGSSNPPASASQSAGIAGISH 

QGQPE*SFDIRFACVIAALRETFQCLCSASRVN 

NKIINRPTHPVESSF 


526 


1876 


A 


4024 


80 


341 


TPSSTSRGTEEQQSSKMAWQRREEKEHLNYR 
RSSAEDGWKADKP/VDG*TPGEDHLPTPSPFQ 
LHIHSSESQLHHSVKSPPSLSFRLM 


527 


1877 


A 


4026 


593 


230 


DFYLYPERKKRGQMMTAVSLTTRPQESVAFE 
DVAVYFTTKEWAIMGNPAERALYRDVMLEN 
YOGCGPL*CHPTSKPALVFS\LEQGKESCFSPA 
TGSSLSRNDWRAOWIGYLELRRYTYLS 


528 


1878 


A 


4028 


1160 


242 


GTSELLCIQRWNWGPAFPPRPGLALAPTLQLL 

VEMGSAKSVPVTPARPPPHNKHLARVADPRS 

PSAGILRTPIQVESSPQPGLPAGEQLEGLKHAQ 

DSDPRSFTLXjIARTPMKTSSGDPPSPLVKQLSE 

VFETEDSKSNLPPEPVLPPEAPLSSELDLPLGT 

QLSVEEQMFPWNQTEFPSKQVFSKEEARQPT 

ETPVASQSSDKPSRDPE1TRSS\GSMRNRWKP\ 

NSSKVL\GKSPLHPSCQDDNSPGTLTLRQGKA 

AFKPLSENVSELK\EGA\ILGTGR\LLKTEGRA 

WEQGQDNHDKENQHFPLVES 






A 


4039 


2 


366 


KDMVIJMEMQSMlTMK.CPQYL*E*RKIPDrrK 

cw*gcgstgilifc/ws*pl*kti»qpr*fkqi;*t 

ILTIIYSlM»EHTFHNAGV*LSDrYPRFKaCGYV 

hteicpmfiavlfvwktwkqf 


530 


ifisn 

lOOU 


A 

n 






J 


LLBVNGN'IIVI'VkI'KAQNKKNKGSRSILFKQL 
RKYGSRIN1IXSKHDKN1CIENYKT*MKEIEA 
/DTOKWKDILCSWIRRIMvIKDILCSWIGRraV 
VKlSILPKVNYRFYLISIKnMAI 




1 RSI 

loot 


A 

A 






278 


tqgteeiykisscewvqasfstplitlhdfkiy 
hkatvikmvwywhrq*kfskn/riesseieph 
lydqfifdkgeknqekgnsffnn/mcwknwif 

T*KR 


532 


1882 


A 


4069 


19 


368 


ndllenfkfwe*fkb*leningtvteketggv 
ykelsspkysgtrqfygqnsnfpgknsmvy 
klfqnte/tegrhpislyefritlrnpnkdniyl 
qiwn4pvslmnivtixcpt 


533 


1883 


A 


4076 


1 


355 


PIRKI^KVAG»KS^m'K»LAELHINNliQ^^lNKI/ 
mn/PFIIASKIUKYSGISLTKEMKDLYTETLLR 
KKEDTNKWKDI/SCFWVGR/LNIVKMPK/VIC 
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l.r=/\o|MluC ACIO, J2r=VjluuuulC Ada, 

r r ucuy Hiiniiiiic> vj^vjiyciuc^ n — nibuuiiic, 
I=Isoleucine, K=Lysine, LpCeucine, 
M=Mediionine, N=Asparagine, P=Proline, 
Q=Giutaniine^ R=A:iginine, S=Serine, 
T=Threonine, V=V8line, W=Trypt(q)han, 
Y=Tyrosine, X-=Unknown, *=Stop codon, 
/=possible nucleotide deletion. \rpos5ibIe 
nucleotide insertion 














IFNAIPIKMPMMCMAKIEKNSS 


534 


1884 


A 


4088 


3 


1931 


lEDSSTRRMESERSPLYRQUDLGYLSSSHWNC 

GAPGQDTKAQSMLVEQSEKLRHLSTFSHQVL 

QTRLVDAAKALNLVHCHCLDIFINQAFDMQR 

DLQrrPKiaJBYTRKKENELYESLMNIANRKQE 

EMKDMIVEI^NTMKEEUJDDATN^IEFKDVI 

VPENGEPVGTREKCaRQIQELnSRLNQAVA 

NKUSSVDYLRESFVGTLERCLQSLEKSQDVS 

VHTTSNYLKQILNAAYHVEVTFHSOSSVTRM 

LWEQnCQnQRTrWVSPPAlTLEWKRKVAQEAI 

ESLSASKLAKSICSQFRTRLNSSHEAFAASLRQ 

LEAGHSGRLEKTEDLWLRVRKDHAPRLARLS 

LESRSLQDVLLHRKPKLGQELGRGQYGWYL 

CDNWGGHFPCALKSWPPDEKHWNDLALEF 

HYMRSLPKHERLVDLHGSVIDYNYOGGSSIA 

VLLIMERLHRDLYTGLKAOLTLETRLQIALDV 

VEGIRnJISQGL\TIRDIKIJCNVl-LDKQNRAKI 

TOLGFCKPEAMMSGSIVGTPIHMAPELFTGK 

YDNSVDVYAFGIIJ'WYICSGSVKLPEAFERCA 

SKDHLWNNVRRGARPERLPVFDEECWQLME 

ACWDGDPLKRPLLGIVQPMLQOIMNRLCKSV 

NSfiQrNROUDDST 


535 


1885 


A 


4090 


2 


417 


ALMPHEANYEEIFLKTDKDMDGFESGLEVRE 
tFLKra/GLPSTLLAHIWALCDSKDCGKLSKD 

ASLQKVreLlTUCPVCnnCGmWRTTDSIWMK 

XTXTDVoniTT O A 

iiriKJuuWLJ\A 


536 


1886 


A 


4102 


569 


829 


DHQK^KNIPCSWIGJUNIVKMSILPKAIYRFSAI 
PIKIPNm?FTm*S»NVTOTTKTQE*AKAILSKK 


537 


1887 


A 


4104 


54 


281 


SIDCEHURRMLVLDPSKRLTIAQKEHKWML 
IEVFVQRPVLYPQEQH^SIGEFNEQVLRLM 
HSLGIDQQKTIB 


538 


1888 


A 


4109 


141 


314 


nanPUORSWSHLKCFYKFILTFFFAGCSQPL 
VPRENTTAWMNAIGLIITALPVS 


539 


1889 


A 


4111 


268 


1 


ASRPWGHSYP*FNQQEVDTLKRPIASSEI*MM 
I^KFATVKKSPGPYRFTAEFSHTFKEDLVPILW 
PLFPlOYREGTLPHSFYEASrrL 


540 


1890 


A 


4142 


198 


2064 


PEPGAGRAATPWGPLFWRGRGSOEICEKAAE 

AALGDFLGLHRRTQQPAVDRLLSDASAQWR 

VRGHGGVRESQRAPQQPGRRRORRPRKRPR 

ORWRREOCOAOOROVCVAAWSQRSIAONN 

DYRLFHKMSNSHPLRPPTAVGEIDHVHILSEH 

IGALLIGEEYGDVTFWEKKRFPAHRVILAAR 

CQYFRALLYGOMRESQPEAEIPLQDTrAEAFT 

MLLKYIYTGRATLTOEICffiVLLDFLSLAHKY 

OFPEUBDSrT^YIXTILNIQNVCMrFDVASLY 

SLPKLTCMCCMFMDRNAQEVLSSEGFLSLSK . 

TALLNIVUU5SFAAPEBCD1FLALLNWCKHNSK 

cNxiAuMQA VKJUrLMal.1 cLLN V VKi'aOXJ.br 

DAILDAIKVRSE55RDMDLNYRGMLIPFFNIAT 

MKYGAQWKGELKSALU3GDTQNYDLDHG 

FSRHPIDDDCRSGIEIKLGQPSnNHVRILLWDR 

DSRSYSYnEVSMDELDWVRVIDHSQYLCRS 

WQiaYFPARVCXYlRIVGTHNTVNKIFHrVAF 

ECMFTNKTFTLEKGLIVPMENVATIADCASVI 

EGVSRSRNALLNGDTKNYDWDSGYTCHQLG 

SGAIWQLAQPYMIGSIRVIXWDCDDRSY 


541 


1891 


A 


4146 


282 


778 


GTLGYPNOARGQPQDNFFAHQWSHHPPISAC 
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Amino acid sequence (A=Alanine OCysteine, 
D=Aspaitic Acid, E^Glutamic Acid, 
F=Phenylalanine, G=Glycine; H=Histidine, 
I^Isoleucine, K=Lysine, L°Ceucine, 
M=Methionin^N=Asparagine, I^Proline, 
Q=Ghilamine, R=Arglnine, S^erine, 
T=Threonine, V>=Valine, W=Tiyptoplian, 
Y=Tyrosinc, X<=Unknown, *=Stop codon, 
A^ssible nucleotide deletion, V=possible 
nucleotide insertion 














HAESENFAFWQDMKWKNKFWGKSLETVPVG 
TVNVSLPRFGDHFEWNKVTSCIHNVLSGQRW 
IEHYGEVLIRNTQDSSCHCKITFCKAKYWSSN 
VHEVQOAVLSRSGRVLHRLFGKWHEGLYRG 
PIPGGQCIWKP 


542 


1892 


A 


4147 


44 


433 


SVDAYV(m)lV>SYR'rnTLLKGA*LTHRYVA 

QDPKQGQLRSLHLTCDSAPAGSQGTWSTSCR 

IhnnJFRGGAQITFLATFDDSPKAVLODRLLLT 

ANVSSENNITOTSKTTFQUELSVKDAVYTW 

SSH 


543 


1893 


A 


4153 


678 


11 


TISYPQCLTQNlYFLISFANVDTFliPIMALDH 

YVAICSALQ*CSinT/ELCQGLPVLA*AGSSUS 

PVHTVIMSRLAFCSSAQISHFYRDAYLLMKIA 

CSHT*VNQHVFLQAVVlJLAPCAliLVSYIRIA 

AAIIJRIPSPTRIUIKACSICSSHLSLVTLFYGTV 

LGia*PPDSFSAQDAIATIMYTVVTSlS4LNPFIY 

SLMNKEVQEAVRRLFSRGSHSSWCW 


544 


1894 


A 


4158 


3 


538 


LLYAQAGVQ'LNLSSLQPQPAGUCQSSHPSLP 

SSWDYRYSTPHPANFFVHvlEFHHVAQAGLEL 

LGSGDLPTSTSHSAOITGVXSHHAPPRLISSEGS 

LLGHLLCLPMVFPLLCVFVUSSSLAGEEAAG 

LRVQKLWPAWLSHLPVCWFHCSGIWSEVIE 

LKVGREGHVLPWQAHWBF 


545 


1895 


A 


4160 


1 


412 


HPLGLGLVPSEIFSPQDKKAADGSILAPARGE 

DLEAGLKGSFMDGRLQASVSVFRIQRVGSAM 

QDTASAMPCLPYYPTSHCFMAOGKSRSQGW 

ELELSGEPAPGWQVLAGYTVTQARYUUDASE 

ANVGQPLRPVDPR 


546 


1896 


A 


4174 


1252 


1190 


FFQVFIFLFLIFFKTEFHSCCPQAVQWHDLDSL 
QPPPPRFKGFSC5LSLPSSWDYRHAPAHPANFV 
FLVETGFLHV\GQVASLELPTSODTPAS\ASQSA 
GITGVSHHA*PRASGRRCW 


547 


1897 


A 


4176 


3029 


1 


AGPDGLAAPASCQJGARGQTRVPGAFSWLAP 

GSHHASEGLAPGVPPAGGVSAQELTAPPQEG 

WGLGAPPAAPRPESDEKRAGSDAVRSFSRGA 

RDSLGQRRLGGTRGAOPAGKQAQRTMGPAS 

GFHSFPPRPHQEPSPRSSCWQHLLWHCPWPQ 

PSKLFRLTPAQLLQGPGVLAAPPGP»HVPGFL 

AQSPWPLPSGPRSP*DPLHQGAL,VPUQGGSP 

HTAPHCLPSVLSPAIQQPLUTAST/SSRSPPAS 

TMAPIPSALAVWEPAGSSPQLSSAPADSSVPLP 

ALnCVLPPWTQKPLLOCLCXJSPLPLLSPPDQV 

RCPPACSPAAASSFSFESQPCPSAPSKASPAPA 

ALVIVGPHHPP*SQQPQSQSVHPHGPGGPQPPL 

AASSLFWMFCQPPPPHPQFLWHRPLPVTGKA 

LASVPLCFRPAPGSLRQTPLPPQFHIPRPGLSAP/ 

PPPASGTSDSSDSRSPSASAARVWPPAVSPPPP 

AARHRPHPPEYFLSPCPFSCGFPRLLORPRRPQ 

ALQTPRAWDLPPGSSPAPLCSGPELP*APPPLP 

PFPRVA*LGSGHPPSAQVPGLW*RCV*GHPIP 

RPVGHS*SGPPHSPPL*APPQAWPLELPPSRQC 

LVJ'LHLRAAQPLDPCCSLSPPGPPLPVPALPS 

WPGRP'SPSPASSQPPVHAOLPOPQSSPLPPGL 

PQLPSLRSQSQQPLLFFQCPaPOAVWGKGSPQ 

PLSPHPPPP/ARTQTFFVASRSLSPGTAPYSVCL 

TPSRSASSLPEWLASSLPKIPQSSGSVPLGPTSP 

MP*CF1IRPSPPLP/LSSPFPA\LRPQAPQFPLHLP 

P*PPAPSPGCPLPPLAQQHQPSPPSPHARSTLT 

PPLWPSLALLP*PLPPPPPVPSFSASLLCSLPAH 
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D=Aq]artic Add, EF<}lutamic Add, 
F=PhcnylaIanine. 0=Glycine, It4Gstidine, 
I=Isoleudne, K=Lysine, Ls'Leucine. 
M=Mediionine. N=Aspaiagine, P^ProIine^ 
Q=Glutamine, R=Aiginine, S^Serine, 
T=Threonine, V=Valinei, W=Tiyptophan, 
Y=Tyrosine, X=iInknown, *=Stop codon, 
^■possible nucleotide ddetion, V=possibIc 
nucleotide insertion 














GTPASPGLGRSCLGKPQTLPWISFWPPSGRLA 
PGTWQPW/PVSPAPLSCLSAWDPWELPSPQPQ 
VCSTABLPTSCLLSSPGPVPAFQPPRFGCL*GPP 
GPPGLPPLQSSLSFPPPH'PVPQPPAPPALQWG 
LHLPOGRTK 


348 


1898 


A 


4180 


2369 


844 


RIHREEDFQFILKGURLLSNPIXQTYIJPNSTK 

KIQFHQELLVLFWKLCDFNKVGQPRGALQGD 

GEQLPQ*PGGRDSVRLRGVGQSCPSLELSPLG 

PSPHP*KFLFFVLKSSDVLDILVPILFFLNDAR 

ADQSRVOLMHIOVFILLLLSOECNFGVRLNKP 

YSIRVPMDIPVFTGTHADLLIVWFHKIITSGHQ 

RLQPLFDCLLTIVVNVSPYLKSLSMVTANKLL 

HIXEAFSTrWFLFSAAQNHHLVFFLLEVFNNI 

IQYQFDOJSNLVYAIIRKRSIFHQLANLPTOPP 

TIHKALQRRRRTPEPLSRTGSQGGAPPWRAPA 

PLPLQSQAPSRPVWWLLQALTS*PRSPRCXJR 

MAPCGPWNLSPSRAWRMAARLRGSPARHGG 

SSGDRP/HSSASGQWSPTPEWVLSWKSKLPLQ 

miRLLQVLVPQVEKICIDKGLTDESEDLRFLQ 

HGTLVGLLPVPHPnjRKYQANSGTAMWFRT 

YMWGVmjlNVDPPVWYDTDVKLFEIQRV 


549 


1899 


A 


4191 


858 


321 


LPWQRLGVLLSRGKMAVTGWLESLRTAQKT 

ALLQDGRRKVHYLPPDGKEMAEEYDEKTSE 

LLVRKWRVKSALGAMGQWQLEVGDPAPLG 

AGNLGPELKESNANPIFMRKDTKMSFQWRIR 

NIJ»yPKDVYSVSVDQKERai\TnTNKKyYK 

KFSIPDLDRHQLPLDDALLSFANTPTAP 


550 


1900 


A 


4192 


1 


1980 


KHTGSDIAGVCGWLLLSQPCGVGLDLDSRLL 

GASAMRRSEVLAEESIVCLQKALNHLREIWE 

UGIPEDQRLQRTEVVKKHIKEU,DMM1AEEE 

SLKERLKSISVCQKELNTLCSELHVBPFQEEG 

ETmQLEKDLRTQVELMRKQKKERKQE\LKL 

LQEQDQELCVEILCMPHYDIDSASVPSLEELNQ 

FRQHVTTLRETKASRREEF/VSSKRQIILCME 

ElJ3HTPDrSFERDVVCEDEDAFCI,SLENIAT\L 

QKLLRQ\LEMQKSQNEAVCEG\LRTQIVRELW 

DRLQIPEEEREAVATIMSGSKAKVRKXALQVLE 

VDRLEELEKCKTMKKVIEAIRVELVQYWDQC 

FYSQEQRQAFAPFCAEDYTESLLQLHDAEIVR 

LKNYYEVHKELFEGVQKWEETWRLFLEFER 

lO^SDPNRFThKGGNLLKEEKQRAKLQKMLP 

KLEEELKARIELWEQEHSKAFMVNGQKFME 

YVAEQWEMHRLEKERAKQERQLKNKKQTET 

EMLYGSAPRTTSKRRGLAPNTPGKARKLNTT 

TMSNATANSSIRPIFGOTVYHSPVSRLPPSGSK 

PVAASTCSGKKTPRTGRHQANJaBNLELNGSI 

LSGGYPGSAPLQRNFSINSVASTYSEFADPSLS 

DSSTVGLQRELSKASKSDATSGILNSTNIQS 


551 


1901 


A 


4194 


3 


1008 


AWHEGLVSSPAIGAYLSASYGDSLWLVATV 

VALLDICFILVAVPESLPEKMRPVSWQAQISW 

KQADPFASLKKVGKDSTVLLMCrrVCLSYLPE 

JurXi X oorr \L# I JUK\f^ v Hjr G\ i v JnJAAtiAMVGI 

LSIVAQTAFLSILMRSLGNKNTVLLGLGFQ^IL 

QLAWYGFGSQAWMN4WAAGTVAAMSSrrFP 

AISALVSRNAESDQQGVAQGlITGnwaCNGL 

OPALYGFIFYMFHVEL'nBLGPKLNSNNVPLQ 

GAVIPGPPFLFGACIVLMSFLVALFIPEYSKAS 

GVQKHSNSSSGSLTNTreRGSDEDIEPLLQDS 

SIWELSSFEEPGNQCTEL 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

nucleotide 
location 
correspondi 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
coiTcsponding 
to last amino 
acid residue 
of pqjtide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
D=Aspaitic Acid, E^hitamic Acid, 
F^Phenylalanine, Q=Glycine, H=Ifistidine, 
I=Isoleucinc. K=lysine. L=Leucine. 
M=^ethionine, N^Aqiaragine, P==ProIliie, 
Q=Glutainine, R=Aisinine, S^Seiine, 
T=Thieoninei V=Valine, W=Tiyptophan, 
Y=Tyn)sine, XKInknown, *=Stop oodon, 
/>^ssible nudeotide deletion, \=possible 
nucleotide insertion 


552 


1902 


A 


4197 


2 


14302 


ARPPPAPGSRQQKQKAAPGAAAAAELRGAR 

EPAPARRRGTMADGGEGEDEIQFLIRTDDEVV 

IXJCTATIHKEQQKLCLAAEGFGNRLCFLESTS 

NSKNVPPDLSICIFVLEQSLSVRALQEMLANT 

VEKSEGQVDVEKWKFMMKTAQGGGHRllX 

YGHAILLRHSYSGMYLCCLSTSRSSTDKLAFD 

VGLQEDTTGEACWWITHPASKQRSEGEKVR 

VGMDLILVSVSSERYLHLSYGNGSLHVDAAF 

QQTLWSVAPISSGSEAAQGYLIGGDVLRLLH 

GHMDECLTVPSGEHGEEQRRTVHYEOGAVS 

VHARSLWRLETLRVAWSOSHIRWGQPFRLR 

HVTTGKYI^IAlEDiaajLLMDKEKADVKSTA 

FTFRSSKEKLDVGVRKEVDGMGTSEKYGDS 

VCYIQHVDTGLWLTYQSVDVKSVRMGSIQR 

KAIMHHEGHMDDGISLSRSQHEESRTARVIRS 

TVFIJNIUTRGUJALSKKAKASTVDLPIESVSL 

SLQDUGYFHPPDEHLEHEDKQNRLRALKNR 

QNLFQEEGMINLVLECIDRLHVYSSAAHFAD 

VAGREAGESWKSILNSLYELLAAURGNRKN 

CAQFSGSLDWUSRLERLEASSOILEVLHCVL 

VESPEALNIIKEGHIKSnSLLDKHGRNHKVLD 

VLCSLCVCHGVAVRSNQHUCDNLLPORDLL 

LQTRLVNHVSSMRPNIFLGVSEOSAQYKKWY 

YELMVDHTEPFVTAEATHLRVGWASTEGYSP 

YPGGGEEWGGNGVGDDLFSYGFDGLHLWSG 

CIARTVSSPNQHLLRTDDVISCCLDLSAFSISF 

RINGQPVQGMFENFNIDGLFFPWSFSAGIKV 

RFLLGGRHGEFKFLPPPGYAPCYEAVLPKEKL 

KVEHSREYKQERTYTRDLLOPTVSLTQAAFT 

PIPVDTSQrVLPPHLERIREKLAENIHELWVMN 

KIELGWQYGPVRDDNKRQHPCLVEFSKLPEQ 

ERNYNLQMSLETLKTLLALGCHVGISDEHAE 

DKVKKMKLPKNYQLTSGYKPAPMDLSFKLT 

PSQEAMVDKLAENAHNVWARDRIRQGWrY 

GIQQDVKNRRhffRLVPYTPLDDR"EKKSNKDS 

lreavrtlloygynleapdqdhaaraevcs 

gtgerfrifraektyavkagrwyfefetvta 

gdmrvgwsrpgcqpdqelgsderafafdgf 

kaqrwhqgnehygrswqagdwgcmvdm 

neh™mftlngeillddsgselafkdfdvgd 

gfipvcslgvaqvormnfgkdystlkyrnc 

glqegyepfavntnrditmwlskrlpqflqv 

PSNHEHmVTRIDGTIDSSPCLKVTQKSFGSQN 

sntdimfyrlsmpiecaevfsktvagglpgag 

lfgpkndledydadsdrevlmktahghlvp 

drvdkdkeatkpefnnhkdyaqekpsrucq 

rfujoitkpdystshsarltedvladdrddy 

dflmqtstyyysvrifpgqepanvwvgwrrs 

dfhqydtgfdldrvrtvtvtlgdekgkvhe 

sdcrsncymvcagesmspgqgrnnngleigc 

wdaasglltfiangkelstyyqvepstklfp 

awaqatspnwqfelgriknvmplsaqlfks 

ehknpvfqcpprlhvqflshvlwsrmpnqfl 

kvdvsriserqgwlvqcldplqfmslhipeen 

rsvdilelteqeellkfhyhtlrlysavcalg 

nhrvahalcshvdepqllyaienkympgllr 

AGYYDIXroiHI^SYATARLMMNNEYrVPMT 
EETKSrUFPDENKKHOLPGIGLSTSLRPRMQF 
SSPSFVSISNECYQYSPEFPLDILKSICnQMLTE 
AVKEGSLHARDPVGGTTEFLFVPLIKLFYTLU 
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PCTAJSOl/03800 



SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

b^inning 

micleotid6 

locatioD 

correspondi 

ngtofiist 

ammo add 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine^ 
D-Aspartic Acid, E°Ghitainic Acid, 
F=Phenylalanine, G=Olycine, IMfistidinc, 
l=Isoleucine, K=Lysine, L=4^cine, 
M=Methionine, N^^A^aragine, P^Proline, 
Q=Ghitaniine, R^Aiginine, S=Serine, 
T=Tlireonine, V=Valine, W-TiyptqAan, 
Y=Tyrosine, X~Unkiiown, ••'Stop codoa, 
/=fioss!bie nucleotide deletion, V=passibte 
nucleotide insertion 














MGIFHNEDLKHILQLIEPSVFKEAATPEEESDT 

LEKELSVDDAKLQOAGBEEAKGGKRPKEGLL 

QMK1J»EPVKIX2MCLLLQYLCDCQVRHIUEAI 

VAFSDDFVAKLQDNQRFRYNEVMQALNMSA 

ALTARKTKEFRSPPQEQINMLLNFKDDKSECP 

CPEEIRDQUXIFHEDLMrHCOlELDEDGSLDG 

NSDLTIRGRLLSLVEKVTYLKKKQAEKPVES 

DSICKSSTLQQLISETMVRWAQESVIEDPELVR 

AMFVLLHRQYDGIG(a,VRALPKTYTINGVSV 

mnNlLASLGQIRSlXSVRMGKEEEKLMIRG 

LGDIMNNKVFYQHPNUvIKAIXJMHErVMEV 

MVNVLGGGESKEITFPKMVANCCRFLCYFCR 

ISRQNQKAMFDHLSYLLENSSVGLASPAMRG 

STTLDVAAASVMDNNELALALREPDLEKWR 

YLAGCGLQSCQMLVSKGYFDIOWNPVEOER 

YLDFLRFAVFCNGESVEENANVWRLLIRRPE 

CFGPAUIGBGGNGLLAAMEEAIKIAEDPSRD 

GPSPNSGSSKTLDTEEEEDDTIHMGNAIMIPY 

SALIDLLGRCAFEMHLIHAGKGEAIRIRSILRS 

LIPLGDLVGVISIAFQMPTIAKDGNWEPDMS 

AGFCPDHKAAMVLFLDRVYGIEVQDFLLHLL 

EVOFtPDLRAAASLDTAALSATDMALALNRY 

LCTAVmJLTRCAPLFAGTEHHASLIDSLLHT 

VYRLSKGCSLTKAQRDSIEVCLLSICGQLRPS 

MMQHLLRRLVFDVPLLNEHAKMPLKLLTNH 

YERCWKYYCLPGGWGNFGAASEEELHLSRK 

LFWGDRDALSQKKYEQELFKLALPCLSAVAG 

ALPPDYMESNYVSMMEKQSSMDSEGNFNPQ 

PVDTSNmPEKUEYFINKYAEHSHDKWSMDK 

LANGWIYGEiySDSSKVQPIAlKPYKLLSEKE 

KEIYRWPIKESLKTMLARTMRTERTREGDSM 

ALYNRTRRISQTSQVSVDAAHGYSPRAIDMS 

NVnTLSRDLHAMAEMMAENYHNIWAKKKKM 

ELESKGGGNHPIXVPYDTLTAKEKAKDREKA 

QDILKFLQINOYAVSROEKDLELDTPSIEKRFA 

YSFLQQURYVDEAHQYILEFDGGSRGKGEHF 

PYEQEDCFFAKWLPLIDQYFKNHRLyFLSAA 

SRPLCSGGHASNKEKEMVTSLFCKLGVLVRH 

RISLFGbroATSIVNOJDLGQTLDARTVMKTG 

LESVKSALRAFLDNAAEDLEKTMENLKQGQF 

THlRNQPKGVTQnNYTTVALLPMLSSLFEHI 

GQHQFGEDULEDVQVSCYRILTSLYALGTSK 

SrSfVERQRSALGECLAAFAGAFPVAFLETHLD 

KH^^YS^YNTKSSRERAALSLPTNVEDVCPNIP 

SLEKLMBEIVELAESGIRYTQMPHVMEVILPM 

LCSYMSRWWEHGPENNPERAEMCCTALNSE 

HMhmJXnOTJKIIYNNLGIDEGAWMKRLAVF 

SQPDNKVKPQlXKTHFLPIJ^ffiKLKKKAATVV 

SEEDHLKAEARGDMSEAELULDEFTTLARDL 

YAFYPLURFVDYNRAKWLKEPNPEAEELFR 

MVAEVFIYWSKSHNFKREEQNFWQNEINN 

MSFUTDTKSKMSKAAVSDQERKKMKRKGD 

RYSMQTSLIVAALKRLLPIGLNICAPGDQELIA 

LAKNRFSLKDTEDEVRDDRSNIHLQGKLEDP 

AIRWQMALYKDLPNRTDDTSDPEKTVERVL 

DIANVLFHLEQKSKRVGRRHYCLVEHPQRSK 

KAVWHKLLSKQRKRAWACFRl^lAPLYNLPR 

HRAVNLFLQGYEKSWIETEEHYFEDKLIEDLA 

KPGAEPPEEDEGTKRVDPLHQLtt.LFSRTALT 

EKCXLEEDFLYMAYADIMAKSCHDEEDDDG 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

conespondi 

ngtojSist 

amino add 

residue of 

p^tide 

sequence 


Predicted end 
nucleotide 
location 
cotre^nding 
to last amino 
acid residue 
of p^de 
sequence 


Amino acid sequence (A=Alanioe OCysteine, 
D=Aq>aitic Acid, EF<jlntaniic Acid, 
F^Phenylalanine, G=Glycinei ft=Histidine, 
I==]soleucine^ K=Lysin6, L^Leucine, 
M=Methionine, N=AsparBgine, INProline, 
Q=Glutaininei R=Aigiimie, S=Seriiie, 
T=Threonlne, V=Valine, W=TiyptDphaii, 
Y=Tyrosine, X^Onknown, ♦=St«>p codon, 
/=possible nucleotide deletion, V=^>ossible 
nucleotide insertion 














EEEVKSFEEKEMEKQKLLYQQARLHDROAA 

EMVLQTISASKGETGPMVAATLKLGIAILNGG 

NSTVQQKMLDYLKEKKDVGFFQSLAGLMQS 

CSVLDLNAFERQNKAEGLGMVTEEGSGEKV 

LQDDEFTCDLFRFLQLLCEGHNSDFQNYLRT 

QTONNTTVNinSTVDyLLRVQESISDFYWYY 

SGKDVIDEQGQRNFSKAIQVAKQVFNTLTEYI 

QGPCTCTIQQSLAHSRLWDAWGFLHVFAHM 

Q\IKL^QDSSQIELLKELMDLQKDMVV\1LLS 

MLEGNWNGTIGKQMVDMLVESSNNVEMIL 

KFFDMFLKLKDLTSSDTFKEYDPDGKGVISK 

RDFHKAMESHKHYTQSETEFLLSCAEIDENE 

TLDYEEFVKRFHEPAKDIGFNVAVLLTNLSEH 

MPNDTRLQTFLELAESVLNYKJPFLGRIEIMG 

SAKRIERVYFEISESSRTQWEKPQVKESKRQH 

FDVVNEGGBKEKNIELFVNFCEDTIFEMQLAA 

QISESDLNERSANKEESEKERPEEQGPRMAFF 

Sn.TVRSALFALRYNILTUidRMLSLKSLKKQM 

KKVKKMTVXDMVTAFFSSYWSIFNITLLHFV 

ASVFRGFFRUCSUXGGSLVEGAKKIKVAELL 

ANMPDPTQDEVRGDGEEGERKPLEAALPSED 

LTDLKELTEESDLLSDJFGLDLKREGGQYKUP 

HNPNAGLSDLMSNPVPMPEVQEKFQEQKAK 

EEEKEEKEETKSEPEKAEOEDOEKEEKAKED 

KGKQKLRQLHTHRYGEPEVPESAFWKKIIAY 

QQKLU^ryTAR^^nfNMRMLAlJ?VAFAINFILL 

FYKVSTSSWEGKELPTRSSSENAKVTSLDSS 

SHRIIAVHYVLEESSGYMEPTVRILPILHTVISF 

FCHGYYCLKVPLVIFKREKEVARKLEFDGLYI 

TEQPSEDDKGQWDRLVINTQSFPNNYWDKF 

VKRKVMDKYOEFYORDRISELLOMDKAALD 

FSDAREKKKPKKDSSLSAVLNSIDVKYQMW 

KLGWFTDNSFLYLAWYMT 


553 


1903 


A 


4199 


31 


767 


LPELNGRGAGLRRABPSERGGGAERTQQVAA 
LPLSHGHSHGGGGCRCAAER/VGAARGSAAC 
AYGLYLRIDKGRLQCLNESREGSGRGVFKPW 
ERADVDRSKFVESDADEELLFNIPFTG\HVKLK 
GmMGEDDDSHPSEMRLYKNIPQMSFDDTER 
EPDQTFSLNRDLTGELEYATKISRFSNVYHLSI 
mSKhffGADTOCVFYIGLRGEWTELRRHEVTI 
CNYEASANPADHRVHQVTPQTHFIS 


554 


1904 


A 


4200 


1 


961 


GIPCTEMGNFDNANVTOEIEFAIHYCFKTHSL 

EICKACKMAYGEEKKKKCNPYVKTYLLPD 

RSSQGKBKTGVQRNTVDPTFQETLKYQVAPA 

QLVTRQLQVSVWHLGTLARRVFLGBVIIPLAT 

WDFEDSTTQSFRWHPLRAKADKYEDSVPQS 

NGELTVRAXLVLPSRTRKLQEAQEQTDQPSL 

HGQLCLWLGAKNLPVRPDGTLNSFVKGCLT 

LPDQQKLRLKSPVLRKQACPQWKHSFVFSGV 

TPAQLRQSSLELTVWDQALFGMNDRIXGGT\ 

RLGSKGDTAVGGDACSQSJa.QWQKVLSSPN 

LWTDMTLVLH 


555 


1905 


A 


4211 


331 


2419 


KBNKKARNLRMNQSRSRSDGGSEETLPQDH 

NHHENERRWQQERLHREEAYYQFINELNDE 

DYRLMRDHNLLGTPGEITSEELQQRLDGVKE 

QLASQPDLRDGTNYRDSEVPRESSHBDSLUB 

WLNTFRRTGNATRSGQNGNQTWRAVSRTNP 

NNGEFRFSLEIHVNHENROFEIHGEDYTDIPLS 

DSNRDHTANRQQRSTSSPVARRTRSQTSVNFN 

GSSSNIPRTRLASRGQNPAEGSFSTU3RLRNGI 
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SEQ© 
NO: of 
nucl- 
eotide 
seq- 
ueooe 


SEQ ID 
NO: of 
pq)tidc 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ngtofiRt 

amino add 

readueof 

pq>tide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
DMspaitic Add, E=GIutamic Acid, 
IM>henylalanine, G=Glycine, H=Histidine, 
I=Isoleucine, K=Lysine, IM^cine, 
M=Methionine, N=Asparagine, P=4^line, 
Q=<jlutaniine^ R^Aiginine, S=Serine, 
T^Threonine, V=Valine, W=Tiyptophan, 
Y=Tyroslne, X^Unknovni, *-Stop codon, 
/=^K>ssible nucleotide deletion, \=passible 
nucleotide insertion 














GGAAGIPRANASR.TNFSSHTNQSGGSELRQRE 

GQRFGAAHVWENGARSNVTVRNTNQRLEPI 

RLRSTSNSRSRSPIQRQSGTVYHNSQRESRPV 

QQTTRRSVRRRORTRVFLEQDRERERRGTAY 

TPFSNSRLVSRrrVEEGEESSRSSTAVRRHPTrr 

LDLQVR\RIRPGENRDRDSIANRTRSRVGLAE 

NTVTIESNSGGFRRTISRLERSGIRTyVSTrrVP 

LRRISENELVEPSSVALRSILRQIMTGFGELSSL 

MEADSESELQRNGQHLPDMHSELSNLGTDN 

NRSQHREGSSQDRQAQGDSTEMHGENETTQP 

HTRNSDSRGGRQLRNPNNLVETGTLPILRLAH 

FFTXNESDDDDRIRGLTKEQIDNLSTRHYEHN 

SIDSELGKICSVaSDYVTGNKLRQLPCMHEF 

HfflODRWLSENCrCWCRQPVLGSNIANNG 


556 


1906 


A 


4212 


3 


462 


LQRQRQHPAAAPAVPVRCFTFCFTDIVIMPKR 

KSPENTEGKDGSKVTKQEPTRRSARLSAKPA 

PPKPEPKPRKTSAKKEPGAKISRGAKGKKEEK 

QEAGKEGTAPSENGETKAEEIHISRSTVNVST 

SRGTPPSTLSVKGQIETVRVKGTEN 


557 


1907 


A 


4213 


774 


507 


ARRFSCLTLQTSWGHRHVGPPRPVANFVFLVET 

OFLfflOQAGHKLPTSGDPPASASQSARITGMS 

HRTWFLASFLIDSCKNFIVYKIMYTL 


558 


1908 


A 


4225 


3 


1253 


TYRHAEREHPETSSATKVSYDYRHKRPKLLD 

GDQDFSDGRTQKYCKEEDRKYSFQKGPLNRE 

LDCFNTGRGRETQDGQVKEPFKPSKKDSIAC 

TYSNKNDVDLRSSNDKWKEKKKKEGDCRKE 

SNSSSNQLDKSQKLPDVKPSPINLRKKSLTVK 

VDViCKTVDTFRVASSYSTERQMSHDLVAVG 

RKSENFHPVFEHLDSTQNTENKPTGEFAQEirr 

nHQVKANYFPSPGITLHERFSVKMADIHKADV 

NEIPLNSDPEIHRRIDMSLAELQSKQAVIYESE 

QTUKUDPNDLRHDIERRRKERLQNEDEHIFHI 

ASAAERDDQNSSFSKNYTTQRKDIITHKPFEV 

EGNHRNTRVRPFKSNFRGGRCQPNYKSGLVQ 

KSLYIQAKYQRLRFTGPRGFTTHKFRERLMRK 

KKVP 


559 


1909 


A 


4235 


1 


323 


KFSIPFFLRWSFTLV\PRLEGND\aSVHCNLGL 
LGLSHSPASASQVGGITGTQHHTGLIFGFIJET 
EFHHVGQAGLELLTSGDPPALAFQSAGITGVS 
HHAWLQVLNS 


560 


1910 


A 


4246 


2 


1569 


TI^LLmviMKDIVTPVPQEEVKTVmKCXEQ 

AALVNYSRLSEYAKIEGKKREMYELPVFCLA 

SQVMDLTIQNQKDAENVORLITPAKKLEDnR 

LAELVEVLQQNEEHHAEAFAWWSDLMVEH 

AETFLSLFAVDMDAALEVQPPDTWDSFPLFQ 

LL\NDFLRTGLLICGNGK\FHKHLQDLFAPLW 

R/YMWDLDGSSPIAQSIHRGLLSRESWEPVNN 

GSGTSEDLFWKLDALQTFIRDLHWPEEm'GK 

HLEQRUOMASDMIBSCVKRTRUAFEVKLQK 

TSSIQQIFRVPQFNMAPCFNVMQLMAKGSIQP 

mCSMEMGQEFAKMWHQYHSKIDEUEBTV 

KEKflTLLVAKFVTILEGVLAKLSRYDEGTLFS 

SFLSFTVKAASKYVDVPKPGMDVADAYVTF 

VRHSQDVUIDKVNEEMYIERLFDQWYNSSM 

NVlCTWLTDRMDLQUnYQIJCrLDUvfVKKTY 

RDFRLQGVLDSTLNSKTYETIRNRLTVEEATA 

SVSEGG<HX3GISMKDSDEEDEEDD 


561 


1911 


A 


4257 


1300 


654 


selvqfixikdqkkipikradiLkhvigdykdi 
fpdlfkraaerlqyvfgyklvelefksntyil 
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SEQ ED 
KO:of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 

USSN 
09/496 
914 


Predicted 
b^inning 
nuuicuuuc 

4ocation 
correspond] 
ng to first 
amino acid 

peptide 
sequence 


Predicted end 
nucleotide 
location 
correspcmding 
to last amino 
acid residue 
of peptide 


Amino acid sequence (A=Alanine OCysteine, 
D=Aspartic Add, E=Glutaniic Acid, 
r— rnenyiaianine, ii=Otycuie, n=xiist]duie, 
I=*Isoleucine, K=Cysine, L=l,eucine, 
Nt=Methionine, N^Asparagine, P=ProIine, 
Q=Glutaimne, R=AiTgiiiine, S=Serine, 
T=Threonine, V=VaUne. W=Trypt<^han, 
X— jyiosme^ A^unjcnown, •=^top coooo, 
/=possibb nucleotide deletion, V^ossible 
nucleotide insertion 














INTLEPVEEDAEMRGDQGTPTTGLLMIVLGLl 

FMKGNTIKETEAWDFLLAL\GVYPTKKHLIFG 

DPKKLITEDFVRQRYLEYRRIPHTDPVDYEFQ 

WGPRTNLBTSKMKVLKFVAKVHNQDPKDW 

PAQYCEALADEENRARPQPSGPAPSS 


562 


1912 


A 


4260 


1 


1498 


MVTWLYRFLPTSNMAAKLRSLLPPDLRLQF 
WLHARLQKCFLSRGCGSYCAGAKASPLPGK 

mamglmcgrrellrllqsgrrvhsvagpsq 

wlgkplttrllfpaapcccrphylflaasgpr 

slstsaisfaevqvqappwaatpsptavpev 

asgetadwqtaaeqsfaelglgsytpvgli 

qnLlefmhvdlglpwwgaiaactvfarcuf 

puvtgqreaarihnhlpeiqkfssrireaicla 

gdhieyykassemalyqkkhgiklykplilpv 

TQAPIFlSFFIALRmylANLPVPSLQTGGLWWF 
QDLTVSDPimPLAVTATMWAVLELGAETG 
VQSSDLQWMRNVIRKlMPLni.PrrMHFPTAV 

fmywlssnlfslvqvscuupavrtvucipqr 
whdudklppregflesfkkgwknaemtrq 

LREREQRMRNQLELAARGPLRQTFTHNPLLQ 
PGKDNPPNIPSS\SSSSSKPKSKYPWHDTLG 


563 


1913 


A 


4265 


623 


116 


MGGLAPTQTLEPT\REYQNTQLSVSYLLPEQN 
THGTRRTLSSGPSNNLPLPLSSSATMPSM.QCK 
HRSPNGGIfRQSPVK/TPPIPMSFQPVPGQWL 
PRGSGNPPHGTSILTAPPAIXPHPPTHPTQQSF 
UQENNNThnnHSHTHTYTETLSFFLYICVNN 
DRMEWGKSVF 






A 

A 




3 


368 


ILKRKLSSLNSEVSTIQNTRMLAFKATAQLFIL 
QCTWCLGLLQVOPAAQVMAYLFTIINSLQOF 
FDFLVYCLLSXQQVQKQYQKWFRETVKSKSES 
ETYTLSSKMGPDSKPSEGDVFPRTSE 


565 


1915 


A 


4288 


83 


406 


rnsrplwcsppasqprqapvsqscccplpssss 
ppsallaptkpralgtlrlyecspelgttmlp 
pawllmlcqaprpqdpdprltqpekslqeap 
gqtgasrtprt 


566 


1916 


A 


4298 


1041 


229 


LNSSQKLACUGVEGGHSLDSSLSVLRSFYVL 

GVRYLTLTFTCSTPWAESSTKFRHHMYTNVS 

GLTSFCTKWEELNRLGMMIDLSYASDTLIRR 

VLEVSQAPVIFSHSAARAVCDNLLNVPDDILQ 

IXKKNGGIVMVTLSMGVLQCNLLANVSTVA 

DHFDHIRAVIGSEnGlGGNYDGTGRFPCJGLVE 

DVSTYPVLIEELLSRSWSEEELQGVLRGNIXR' 

VFRQVEKVREESRAQSPVEAEFPYGQLSTSCH 

FHLGASEWTPRLLIWR 


567 


1917 


A 


4299 


1 


1106 


GATPLGSVGGRTGKMDAATLTYDTLRFAEFE 

DFPETSHPVWILGRKYSIFTEKDEILSDVASRL 

WFTYRKNFPAIGGTGPTSDTGWGCMLRCGQ 

MIFAQALVCRHLGRDWRWTQRKRQPDSYFS 

VLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQ 

WYGPNTVAQVLKKLAVFDTWSSLAVHIAMD 

Nl V VMIIKIKIU^LRTS VFCAGATAITADSDRH 

CNGFPAGAEVTNRPSPWRPLVLLIPLRLGLTD 

INEAYVETLKHCFMMPQSLGVIGGKPNSAHY 

nGYVGEELIYLDPHTTQPAVHTDGCFIPDES 

FHCQHPPCRMSIAELDPSIAWRGGHLSTQAF 

QAECCLGMTRKTFGFLRFFFSMLG 


568 


1918 


A 


4300 


2012 


1843 


SRKFLTlTPIVLYFLTSFrrKYDQIHFVLNTVS 
LMSVUPKLPQLHGVRtFGINKY 


5^9 


1919 


A 


4302 


186 


531 


WTFCLFL/WWVPESARWLLTQGHVKEAHRY 
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Amino acid sequence (A^^Alanine 0=Cysteine, 
D=Aspaitic Acid, E=Glutanuc Acid, 
F=Phenylalaninc, Q=01ycinc, H=Histidine, 
Msoleucine, K=Lysine, L=Leucine. 
M=Wethionine, N=Asparagine, P=ProIinei 
Q=Glutaniine, R=Aiginine, S=Serine, 
T=Threonine, V=Valine, \Vfc=Tiyptophan, 
Y=Tyrosine, X-Unknown, *=aop codon, 
A=possible nucleotide deletion, V^ossible 
nucleotide insertion 














IXHCARLNGRPVCEDSFSQEVRVNVCVSMHl 
CVWWGVGCVKCLPPRAHHfWQBKPLGPHRT 
VTESKLEAEGKTKEKAREKERKKKS 


570 


1920 


A 


4308 


3 


869 

• 


RSGQOKVYGUGRRRFQQMDVLEGLNLLinS 

GKRNKLRVYYLSWLRNKILHNDPEVHKKQG 

WTTVGDMEGCGHYRWKYERIKFLVIALKSS 

VBVYAWAPKPYHKFMAFKSFADLPHRPLLV 

DLTVEEGQRLKVIYGSSAGFHAVDVDSGNSY 

DrVTPVfflQSQITPHAIIFLPNTDGMEMLLCYE 

DEGVYVNTYGRnKDWLQWGEMPTSVAYIC 

SNQIMGWGEKAIEIRSVETGHLDGVFMHKRA 

QRLKFLCERNDKVFFASVRSGGSSQVYFM7L 

NRNCIMNW 


571 


1921 


A 


4309 


9 


524 


ASREMDVTKVCGEMRYQLNKTNMEKDEAE 
KEHREFRAKTNRDLEIKDQEIEKLRIEU)ESK 
QHLEQEQQKAALAREECLRLTELLGESEHQL 
HLTRQEKDSIQQSFSKEAKAQALQAQQREQE 
LTQKIQQMEAQHDKTENEQYLLLTSQNTFLT 
KLKEECCTLAKKLEQISQ 


572 


1922 


A 


4318 


1 


1119 


GATPLGSVGGRTQKMDAATLTYDTLRFAEFE 

DFPETSH'VWILGRKYSIFTEKDEILSDVASRL 

WFTYRKNFPAIGGTGPTSDTGWGCmRCGQ 

XOFAQALVCRHLGRDWRWTQRKRQPDSYFS 

VLNAFIDRKDSYYSIHQIAQMGVGEGKSIGQ 

WYGPNTVAQVLKKLAVFDTWSSLAVHIAMD 

NTWMEEIRKLCRTSVPCAGATAFPADSDRH 

CNGFPAGAEVTNRPSPWRPLVIXIPLRLGLXT 

DINEAYV\ETL\KHCFHGWPQFPG/WHREGK 

PNSAHYFIGYVGEEUYIXlPHTrQPAVEPTDG 

CapIPDESFHCQHPPCRMSIAELDPSIAWRGGH 

LSTQAFGAECCIXaiTRKTFGFLRFFFSMLG 


573 


1923 


A 


4333 


363 


1066 


GGVPVGLASKPFQILYGHTNEVLSVGISTBLD 
MAVSGSRDGTVmmQKGQYMRTLRPPCESS 
Ua-TIPNLAISWEGHIVVYSSTEEKTTLK\ERM 
HYlCFSlNGKYLGSQILKEQVSDICnGEHIVTG 
SIQGFLSIRDLHSLNLSINPLAMRLPIHCVCVT 
KEYSfflLVGLEDGKUWGVGKPAEVKPSISN 
FISHAVGDYFGSPSFQUEKSPLGINKLKAKFD 
FSKGSK 


574 


1924 


A 


4346 


359 


1234 


MDTLEEVTWANGSTALPPPLAPNISVPHRCLL 

LLYEDIGTSRVRYWDIXUJPNVLFLIFLLWK 

LPSARAKIRrrSSPIFriTYILVFVVALVGlARA 

WSNm^STSNAATVADKILWEITRFFLLAlEL 

SVnLGLAFGHLESKSSKRVLAnTVLSLAYSV 

TQGTLEILYPDAHLSAEDFNIYGHGGRQFWL 

VSSCTFFLVYSLWILPKTPLKERISLPSRRSFY 

VYAGIL^LI-NLLQGLGSVIXCFDIIEGLCCVD 

ATTFLYFSFFAPUYVAFLRGFFGSEPKILF 


575 


1925 


A 


4360 


2038 


1512 


GCWWRHPWLASQRDCLDCRIQLAEKFVKAV 

SKPSRPDMNPIRVKEVYRLEEMEKIFVRLEM 

KIDCGSSGTPKLSYTGRDDRHFVPMGLYIVRT 

ADSIAPFHICYYGRLFWEWGDGIRVHDSQKP 
QDQDKLSKEDVLSnQMHRA 


576 


1926 


A 


4365 


69 


500 


QVEGRQGREVKRTAWRISPVWRPARCRRRST 

PQP/PE/PGAQQQERHRQGEAPMQALDPRAEP 

GPQAQSHAACQPEPEPPRVLLDPTAARGGVQ 

(HIP/GLSRHPOLAPHPQTHTPWPQSGRLPCAS 

EPLPLGGIRnKJLEPKGRDLM 
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D>Aspartic Acid, E^lutamic Acid, 
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/=possible nucleotide deletion, V=possibIe 
nucleotide insertion 


577 


1927 


A 


4366 


785 


502 


SAPPKKKNGVLFLSPRLKSSGAIWVHSTPTLW 
ASSNSRASTPKVAGITGARPHARIIFVFLIEMG 
FHNVGQAGL/DTLTLVICPPQPPKLLGLQM 


578 


1928 


A 


4367 


1 


221 


FKKFLKKSRCVTQAGVQGVPISLHPPFPGFKRF 

SRLSLLSSWDYRHP/HAANFCEFSRDGWSPYW 

SGWSRTPDLR 


579 


1929 


A 


4383 


1 


224 


FETESHSVTQAGMQWHNLGSLQPMP/PGLKR 
FSCLRLQSSWDHRHAPPHLAHFCIFSRIXJVSP 
CWPGWSSTPDLK 


580 


1930 


A 


4397 


410 


94 


SRLKPYSTNVTAKKLPATNIPNLDCFrAKLYQ 
WFKKGIMHILHELFQNKEEGAFPNS/FYEASFT 
LRPKSDRDIAKEESYSnSLLSTDTKILMSKYK 
QLKSSDL 


581 


1931 


A 


4414 


670 


3 


VLVHRQCGGILRLRRKEAVSVLDSADIEVTDS 

RIJHATIVDHRPQHRWLETCNAPPQLIQGKA 

RSAPKPSQASGHFSVELVRGYAGFGLTLGGG 

RDVAGDTPLAVRGLLKDGP\AQRCXJRLEVGD 

LVLHINGESTQGLTVHAQAVERIRAGGPQLHL 

VIRRPLETHPOKPRGVQEPRKOWPSWPDRSP 

DPOGPEVTGSRSSSTSLVQHPPSRTTLKKTRG 

SPE 


582 


1932 


A 


4424 


194 


449 


VLYIRKKKltLEKLRHQLMPMYNFDPTEEQDE 

LEQELLEHGRDAASVQAATSVQAMQGKTTL 

PS\QGPLQRPSRLVFI\DVANAIHV 


583 


1933 . 


A 


4435 


1 


166 


APGPPVPPPGSPPEQMPGPCPASMPP/DPPPGS 
PPEQMPGPCPVSAPP/GPPPGSPPEQMPGPCPV 
SAPPALLQDTSV 


584 


1934 


A 


4439 


1 


628 


SATPQQPSAPQHQGTLNQPPVPGMDESMSYQ 

APPQQLPSAQPPQPSNPPHGAHTLNSGPQPOT 

APATQHSQAGPATOQAYGPHTYTEPAKPKK 

GQQLWNRMKPAPGT\EVSSSTSRSDPLLLPPR 

ALAPTQRASTWLAPSPT/SEKVQNHSGSSAR 

GNLSGKPDDWP/LGHERVCGALLHRL*VGGG 

QGPHGKAAQGGAAGAAAGRLGLYH 


585 


1935 


A 


4463 


10 


144 


HKPVTNSRDTQEVPLEKAKQVLKIIATFKHTT 
SIFDDFAHYEKRQ 


586 


1936 


A 


4464 


1309 


103 


LNAESYVSFTTKLDIPTAAKYEYGVPLQTSDS 

FLRFPSSLTSSLCTDNNPAAFLVNQAVKCTOK 

INLEQCEEIEALSMAFYSSPEILRVPDSRKKVPI 

TVQSrVIQSLNKTLTRREDTDVLQPTLVNAGH 

FSLCVNWLEVKYSLTYTDAGEVTKADLSFV 

LGTVSSVWPLQQKFEIHFLQENTQPVPLSGN 

PGYWGLPLAAGFQPHKGSGnQTTNRYGQLT 

ILHSTTEQDtaLAI^VRTPVLFGYTMQSGCJC 

LRLTGALPCQLVAQKVKSLLWGQGFPDYVA 

PFGNSQGP/ADMLDWVPIHFrTQSFNRKDSCQ 

LPGALVBEVKWTKYGSLLNPQAKIVNVTANU 

SSSFPEANSGNERTILISTAVTFVDVSAPAEAG 

FRAPPAINARLPFNFFFPFV 


587 


1937 


A 


4471 


614 


387 


LLGRASAC/LQLQSSW/D/HRPMLPYLANFVF 
CKDR/SFTWLPRLVLNSWLQVUXPWPPTGCD 
NKJUirrUrAl KjuUlaUM 


388 


1938 


A 


4480 


1720 


1458 


HDLGSLQPPPPGFKKFSCXSLPSSWDYRLMPP 
CPANFCnn/DFLVETGFHHVGQASHELLTSGD 
PPTSASQSAGITGMSYHTWFGES 


589 


1939 


A 


4487 


922 


332 


APVTTSPRVGQPW/RTALALRSLYRARPSLRC 
PPVELPWAPRRGHRLSPADDELYQRTRISLLQ 
REAAQAMY1DSYNSRGFMINGNRVU3PCALL 
PHSWQWNVGSHQDITEDSFSLFWLLEPRIEI 
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M=Methionine, N=Asparagine, P=Proline, 
Q=Ghitamine, R=AiEinine, S=Serinei 
T=Threonine, V=Valine, W=Tiyptophan, 
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VWGTGDRTERLQSQVLQAMRQR<HAVEVQ 
DTPNACATFNFLCHEGRVTGAALIPPPGGTSL 
TSLGQAAQ 


590 


1940 


A 


4492 


1 


472 


Fl'FFKrESRSVAQAGVQWRDLGSLQAPPPGFT 

PFSCLSLPSSWDYRRPPLRPANFFVFLVETGFP 

RPSRDGLDLLT/S/GDPPTSASQSAGITGVSHR 

ARPKRIGEPRRKCGNAWWPSTSLGDHRVTS 

VPHQGGLPGPIRVAPSSAGQREASQGPPGR 


591 


1941 


A 


4495 


1444 


1116 


rAARFTLAKTWNQLKRP\TMIDSIKKTR\YIYT 
MEYYADTERNEIMSFVAGTWVELEAIILSKLM 
LKDNWVEDTIPQGAVPCTATAEGMKRLLFAL 
EPWDSSCFPHPSSGV 


592 


1942 


A 


4496 


2 


919 


RTRPLFSQRPTRPVCTMSDERRLPGSAVGWL 

VCGGLSLLANAWGILSVGAKQKKWKPLEFL 

LCTLAATHMLNVAVPIATYSWQLRRQRPDF 

EWNEGLCKVFVSTFYTLTLATCFSVTSLSYHR 

MWMVCWPVNYRLSNAKKQAGHTVMGIWM 

GSFILSALPAVGWHDTSERFYTHGCRFIVAH 

GLGFGVCFLLLVGGSVAMGVICTAIALFQTL 

AVQVGRQADHRAFTVPnWEDAQGKRRSSI 

DGSEPAKTSLQTTGLVrnVFIYDCLMGFPVL 

GPFSLADTHLSDLPYTWGDRDSGGACVM 


593 


1943 


A 


4506 


2 


193 


KFKliAESCSVPQAGVQRPDLGWLHAPPP\GSC 
HFPASASQVAGTrHARHHTQUFlAFLVENGL 
C 


594 


1944 


A 


4507 


1327 


647 


KMAGGVRI>LRGLRALCRVLLFLSQFCILSGG 

ESTEIPPYVMKCPSNGIX:SRIJPADCIDCTTNFS 

CTYGKPVTFDCAVKPSVTCVDQDFKSQKNFn 

NMTCRFCWQLPETDYECTNSTSCMTVSCPRQ 

RYPANCTVRVDHVHCLGNRTFPKMLYCNWT 

GGYKWVYGLWLLRHHPRWOU3ADRF\YLOP 

VAGTASGKLFSFGGLGIWTLIDVLLIGVGYVG 

PADGSLYl 


595 


1945 


A 


4512 


533 


264 


FFFKMHSYSVARLECSGAISAPCNLHIXGSNN 
SPASASRV/AGNIGARMJTQQIFVLLVQMRVH 
YVGQDGLDLL/NLMIHPPRSPKVLGLQA 


596 


1946 


A 


4513 


3 


1674 


HASDHLYPNFLVNELILKQKQRFEEKRFKLD 

HSVSSTNGHRWQIFQDWLGTDQDNLDLANV 

NLMLELLVQKKKQLEAESHAAQLQILMEFLK 

VARRNKREQLEQIQKELSVLEEDIKRVEEMS 

GLYSPVSEDSTVPQFEAPSPSHSSnDSTEYSQP 

POFSGSSQTKKQPWYNSTLASRRKRLTAHFE 

DLEQCYFSTRMSRISDDSRTASQLDHFQEOLS 

KF\TRYNSVRPL\A1XSYASDLYNGSQYKSLV 

FEFDRDCDYPAIAGVTKKIKVYEYDTVIQDA 

VDIHYPENEMTCNSKISCISWSSYHKNLLASS 

DYEGTVILWDGFTQQRSKVYQEHHCRCWSV 

DFNLMDPKLLASGSDDAKVKLWSTNLDNSV 

ASIEAKANVCCVKFSPSSRYHLAFGCADHCV 

HYYDLRNnCQPIMVFKGHRKAVSYAKFVSG 

EErVSASTDSQLKLWNVGKP\YCLRSFKGHIN 

EKNFWGLASNGDYIACGSENNSLYLYYKGLS 

K1LLTFKFDTVKSVLDKDRKEDDTNEFVSAV 

CWRAIJPDGESNVLIAANS\QGTI\KVLELV 


597 


1947 


A 


4518 


536 


824 


RSLALSPGLECSGNaSAHCNLHLLGSSDPPTS 
ASQVAEirSVRHHTWLIFCIVLGQMGFHHVGE 
QAGLEU.TSWDPAELPSQSAGnGMSPHAWM> 


598 


1948 


A 


4524 


1 


384 


FDTEFVNIGGDFDAAAGVFRXCRLPGAYFFSF 
TIXiKLPRKTLSVKLMKNRDEVQANlIYDDGSS 
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RRREMQSQSVMLALRRGDAVWLLSHDHDG 
YGAYSNHOKYrrFSGFLVYPDLAPAAPPGLG 
ASELL 


599 


1949 


A 


4526 


366 


776 


MQQPAPYAEGPIQGGDAGELCKCDFLVFTSP 
NPEAVCEAGTPAMFQTAWRQMESCSyAQAG 
VQWRDPGSLHPPPLGFKRFSCLSU'SSWDYK 
HAPPHPANFCIFSRDQVSPCWPGWSRSLDLVI 
PPPWLPKVU3LQA 


600 


1950 


A 


4529 


776 


334 


FFFETESCYVAQAGVQWCDLCSLQAPPPG\SS 
DPPASASRVAGTTGARHHTQLIFVFLVETGFH 
\MLARDGLKLLTSSDPPASASQSSWDYRREPP 
RLANFFVFLVETGSRYVAQAGVQWLFTGAIP 
IXISTGVLTCSVSDLGRFTPP 


601 


1951 


A 


4533 


1460 


403 


HEVQESmFLESEFSROISDNYTLALTTYALSS 

VGSPKAKE/VLNMLTWRAEQEGGMQFWVSSE 

SKLSDSWQPRSLDIEVAAYALLSHFLQFQTSE 

GIPIMRWLSRQRNSLGGFASTQDTTVALKALS 

EFAAIAINTERTOIQVTVTGPSSPSPVKFLIDT 

HNRLLLQTAELADGTANGSV/SISANGFGFAI 

CQLNWYNVKASGSSRRRRSIQNQEAFDLDV 

AVKENKDDLNHVDLNVCTSFSGPGRSGMAL . 

MEVNIXSGFMVPSEAISLSETVKKVEYDHGK 

LNLYLDSVNETQFCVNIPAVRNFKVSNTQDA 

SVSrVDYYEPRRQAVRSYNSEVKLSSCDLCSD 

VQRLPSL 


602 


1952 


A 


4540 


1963 


295 


MRAPORPALRPLPLPPLLLLLLSSPWGRAVPC 

VSGGIPKPANITFLSINMKNVLQWTPPEGLQG 

VKVTYTVQYFIYGQKKWLNKSECRMNRTYC 

DLSAETSDYEHQYYAKYKAIWGTKCSKWAE 

SGRFYPFLETQIGPPEVALTTDEKSISWLTAP 

EKWKKNPEDLPVSMQQIYSNLKYNVSVLNT 

KSNRTWSQCVTNHTLVLTWNLHWLYCVHV 

ESFVPGPPRRAQPSEKQCARTLKDQSSEFKAK 

IIFWYVLPISrrVFLFSVMGYSIYRYIHVGMCBK 

HF\ANLILryGVNEFDKIUTWA\EKIV\INFIVTL 

NIS\DDSK1SHQDMSLLGKSSDVSSLNDPQPSG 

NLRPPQEEEEVKHLGYASHLMEIFCDSEENT\ 

EGTSFTQQESLSRTIPPDKTVIEYEYDVRTIDI 

CAGPEEQELSLQEEVSTQGTLLESQAALAVL 

GPQnLQYSYTPQLQDLDPLAQEHTDSEEGPffi 

EPSTTLVDWDPQTGRLOPSLSSFDQDSEGCE 

PSEGDGLGEEGLLSRJLYEEPAPDRPPGENETY 

LMQFMEEWCSLYVQMEN 


603 


.1953 


A 


4543 


3 


600 


YSAVEFVEQASGISDWWNPALRKRMLSDSGL 

CHkllAPYYEDSDLKDLSHSRVLQSPVSSEDHAI 

LQAVIAODLMKLIESYKNGGSLLIQGPDHCSL 

LHYAAETGNGEIVKYILDHGPSELLDMADSE 

TGETALHKAACQKNRAVCQLLVDAG/VSLRK\ 

TDSKGKTPQERAQQA\GDPDLAA/YTIESRQN 

YKVIGHEDLETAV 


604 


1954 


A 


4548 


3 


938 


QDNKVQNGSLHQKDTVHDNDFEPYLTGQAN 

QSNSYPSMSDPYLSSYYPPSIGFPYSLNEAPW 

STAGDPPIPYLTTYGQLSNGDHHFMHDAVFO 

QPGGLONNIYQHRFNFFPENPAFSAWGTSOS 

QGQQTQSSAYGSSYTYPPSSLGGTWDGQPG 

FHSDTLSKAPCaiNSLEQGMVGLiaGDVSSSA 

VKTVGSWSSVALTGVLSGNGGTNVNMPVS 

KFTSWAAIASKPAKPQPKMKTKSGPVMGGG 

LPPPPKHNMDIGTWDNKGPVPKAPVPQQAP 
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SPQAAPQPQQVAQPLPAQPPALAQPQYQSPQ 
QPPQ 


605 


1955 


A 


4553 


2 


2304 


ILLQEKRNCLLMQLEEATRLTSYIXJSQLKSLC 

ASTLTVSSGSSRGSLASSRGSLASSRGSLSSVS 

FTDIYOLPQYHKPDAEOSQLLRFDUPFDSLGR 

DAPFSEITGPSGFHKQRRSLDTPQSLASLSSRS 

SLSSLSPPSSPLDTPFLPASRDSPLAQLADSC3 

Ca>GLGALDRLRAHASAMGDEDLPGMAALQP 

HGVPGDGEGPHERGPPPASAPVGGTVTLRED 

SAKRLERRARRISACLSDYSLASDSGVFEPLT 

KRNH^AEEPAYODTASNGDPQIHVQLLRDSG 

SECLLVHVLQLKNPAGLAVKEDCKVHIRVYL 

PPUJSGTPNTYCSKALEFQVPLVFNEVFRIPV 

HSSALTLKSLQLYVCSVTPQLQEELLGIAQIN 

LADyDSLSEMQLRWHSVQVFTS\LNHQGRGR 

LGVQERAPPGTLHTPSPSPA/STDAVTVLLAR 

TTAQLQAVERELAEERAKLEYTEEEVLEMER 

KEEQAEAISERSWQADSVDSGCSNCTQTSPPY 

PEP(XMGiDSILGHPFAAQAGPySPEKFQPSPL 

KVDKETNTEDLFLEEAASLVKERPSRRARGSP 

FVRSGnVRSQTFSPGARSQYVCRLYRSDSDS 

STLPRKSPFVRNTLERRTLRYKQSCRSSLAEL 

MARTSLDLELDLQASRTRQRQLNEELCALRE 

LRQRLEDAQLRGQTDLPPWVLRDERLRGLLR 

EAERQTRQTKLDYRHEQAAEKMLKKASKEl 

YQLRGQSHKEPIQVQTFREKIAFFTRPRDgiPPL 

PADDV 


606 


1956 


A 


4555 


3429 


776 


PGSGPGPAPFLAPVAAPVGGISFHLQIGLSREP 

VLLLQDSSGDYSLAHVREMACSrVDQKFPEC 

GFYGKfroKILLFRHDPTSmiLQLVKAASDIQ 

EGDUEVYLSASATFEDFQIRPHALFVHSYRA 

PAFC3DHCGEMLWGLV\RQGLKCEGCGLNYH 

KRCAFKIPNNCSGVRRRRLSNVSLTGVSTIRT 

SSAaSTSAPDEPLLQKSPSESFIGREKRSNSQ 

SYIGRPlHUJKIUvISKVKVPHTFVIHSYTRPTV 

(XJYOaaXKGLFRQGLQCKDCRFNCaiKRCA 

PKVPNNCLGEVnNGDLLSPGAESDWMEEG 

SDDNDSERNSGLMDDMEEAMVQDAEMAMA 

ECQNDSGEMQDPDPDHEDANRTISPSTSNNIP 

LMRWQSVKHTKRKSSTVMKEGWMVHYTS 

KDTLRKRHYWRUJSKCaTLFQNDTOSRYYKE 

IPLSEILSLEPVKTSALIPNGANPHCFErn'ANV 

VYYVGENWNPSSPSPNNSVLTSGVGADVAR 

MWEIAIQHALMPVIPKGSSVGTGTNLHRDISV 

SISVSNCQIQENVDISTVYQIFPDEVLGSGQFGI 

VYGGKHRKTGRDVAIKIIDKLRFPTKQESQLR 

NEVAnXJNUfflPOVVNLECMFETPERVFVVM 

EKLHGDMLEMILSSEKGRLPEHITKFLmQILV 

AUUIUIFKNIVHCDLKPENVLLASADPFPQV 

KLCDFGFARnOEKSFRRSWGTPAYLAPEVL 

RNKGYNRSLDMWSVGVHYVSLSGTFPFNED 

EDIHDQIQNAAFMYPPNPWKEISHEAIDLINN 

LJlA^Vis.AliuUCXdVJL/Kll^anrWLQl/1 0^1 WitDL 

RELECKIGERYTrHESDDLRWEKYAGEQGLQ 

YPTHLINPSASHSDTPEXEETEMKALGERVSIL 


607 


1957 


A 


4563 


1 


4499 


SRPWWLRASERPSAPSAMAKRSRGPGRRCLL 

ALVLFCAWGTXAWAQKPGAGCPSRCLCFRT 

TVRCMHLLLEAVPAVAPQTSILDLRFNRIREI 

QPQAFRRLKNLNmXNNNQKRIPSGAFEDL 

ENLKYLYLYKNEIQSIDRQAFKGLASLEQLYL 
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Amino acid sequence (A=Alanine OCysteine^ 
D=Aspaitic Acid, E=Glutamic Acid, 
F=Hienylalanine, G^Glycinc, H^stidioe, 
I=Isoleucine, K=Lysine, Ir=Leucine, 
M^Methionine, N=Aspaiagine, P=Proline, 
Q=Glutainine, R=Arginine, S=Serine, 
T=Thra)ninei V=Valine, W=Tryptophan, 
YKTyrosinei X=Unknown, *=^p codon, 
/T)ossible nucleotide deletion, V=possible 
nucleotide insertion 






- 








HFNQlETIJDPDSFQHLPKlEIUJUlNNRrrHL 

VPGTET-IHLESMKRLRLDSNTLHCDCEILWLA 

DLLKTYAESGNAQAAAICEYPRRIQGRSVATI 

TPEELNCERPRrrSEPQDADVTSGNTWFTCR 

AEGNI^EnWLRNNNELSMKTDSRLNLLDD 

GTLMIQNTQETDQGIYQCMAKNVAQEVKTQ 

EVTLRYFGSPARPTFVIQPQNTEVLVGESVTL 

ECSATGHPPPRISWTRGDRTPLPVDPRYNITPS 

GGLYIQNWQGDSGEYACSATNNIDSVHATA 

FIIVQALPQFTVTPQDRVVIEGQTVDFQCEAK 

GNPPPVIAWTKGGSQLSVDRRHLVLSSGTLRl 

SGVALHDQGQYECQAVNnGSQKWAHLTVQ 

PRVTPVFASIPSDTTVEVGANVQLPCSSQGEP 

EPAITWNKDGVQVTESGKFHlSPEGFLnNDV 

GPADAGRYECVARNTIGSASVSMVLSVNVPD 

VSRNGDPFVATSIVEAIATVDRAINSTRTHLF 

DSRPRSPNDLLALFRYPRDPYTVEQARAGEIF 

ERTLQLIQEHVQHGLMVDLNGTSYHYNDLVS 

PQYLNUANLSGCTAHRRVNNCSDMCEHQKY 

RTHDGTCNNLQHPMWGASLTAFERLLKSVY 

ENGFNTPRGINPHRLYNGHALPMPRLVSTTLI 

GTETVTPDEQFTHMLMQWGQFLDHDLDSTV 

VALSQARFSDGQHCSNVCSNDPPCFSVMIPPN 

DSRARSGARCMFFVRSSPVCGSGMTSLLMNS 

VYPREQINQLTSYIDASNVYGSTEHEARSIRD 

LASHRGLLRQGIVQRSGKPLLPFATGPPTECM 

RDENESPIPCFLAGDHRANEQLGLTSMHTLW 

FREHNRIATELLKLNPHWDGDnyyETRKIVG 

AEIQHnYQHWLPKILGEVGMRTLGEYHGYD 

PG1NAGIFNAFAT\AAERFGHTLVNPLLLPGIX> 

ENFQPIAQDHLPLHKAFFSPFRIVNEGGIDPLL 

RGLFGVAGKMRYPSQLLNTELTERLFSMAHT 

VALDLAAINIQRGRDHGIPPYHDYRVYCNLS 

AAHTFEDLKNEDCNPEIR^LKRLYGSTLNID 

LFPALWEDLVPGSRLGPTLMCLLSTQFKRLR 

DGDRLWYENPGVFSPAQLTQIKQTSLARILCD 

NADNTIRVQSDVFRVAEFPHGYGSCDEIPRVD 

LRVWQDCCEDCRTRGQFNAFSYHFRGRRSLE 

FSYQEDKPTKKTRPRKIPSVGRQGEHLSNSTS 

A\FSTRSDASGVTNDFQRVCSWEMQKTni)LR 

TQDCKLESRVLSTIECVDAGGESHANNTKWK 

KDACnCECKDGQVTCFVEACPPATCAVPVNI 

PGACCPVCLQKRAEEKP 


608 


1938 


A 


4566 


354 


1135 


FSFLaGVSGRLGLDSEEDYYTPQKVDVPKAL 

nVAVQCGCDGTFLLTQSGKVLACGLNEFNKL 

GLNQCMSGIINHEAYHBVPYTTSFTLAKQLSF 

YKXRTIAPGKTHTAAIDERaRLLTFGCmCGQ 

LGVGNYKKRLGINU.GGPLGGKQVIRVSCXJD 

EFIlAATDDNHIFAWGNGGNGRLAMlPrERP 

HGSDICTSWPRPIFGSLHHVPDLSCRGWHTILI 

VEKVLNSKmSNSSGLSIGTVFQSSSPGGGGE 

GGPDAW 


609 


1959 


A 


4567 


] 




rrrrHl iloKa VAQAGVQWRDLGSLQAPPPGFT 

PFSO^LPSSWDYRRPPLRPANFFVFLVETGF 

HRFSRDGLDLLT/S/GDPPASASQSAGITGVSH 

IWRPRINIJWVIYSFAVTYCLNYISLAMSSTL 

KLSFHVLSGS 


610 


1960 


A 


4570 


697 


467 


ECROVlSAH\CXnLaj>SSSDSASAF\RVARTT 
GTCDYAQUFAFLVEMQFHHVGQDGLHLL/N 
LVIRPPRPPKVLGLQA 
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SEQ ID 
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nucleotide 
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to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
D^Aspartic Add, &=Ohitainic Add, 
F=Phenylalanine, G=Glycine, H=Histidine. 
I=Isoleudne, K=Lysine, Iy=Leucine, 
M=^etliianine, N=Asparagine, P=*roline, 
QKHutamine, R=Aiginine, S=i«ine, 
T^^ThreoDine, V=Valine, W=n'iyptophan, 
Y^iyrosine, X=Unknown, *"Stop codon, 
possible nucleotide deletion, V=possibIe 
nucleotide insertion 


611 


1961 


A 


4571 


25 


1396 


ADPHITVIRFFPAASATKRVLPPVLRVSSPRT 
WNPNVPESPRIPAPRLPKRMSGAPTAGAALM 
LCAATAVLLSAQGGPVQSKSPRFASWDEMN 
VLAHGLLQLOQGNCANnOAHPQSAERAGAVR 
LSACGSACQGTnEGSTDLPLAPESRVDPEVLHS 
LQTQLKAQNSRIQQLFHK^AQQQRHLEKQHL 
RIQHIXJSQFGUJ5HKHIJDHEVAKPARRKRLP 
mAQPVDPAHNVSRLHRLPRDCQELFQVGER 
QSGLFEIQPQGSPPFLVNCKMTSDQO'WTVIQR 
RHDGSVDFNRPWEAYKAGFGDPHOEFWLGL 
EKVHSrrcDRNSRLAVQLfiDWDGNAHXQFS 
VHLGGEDTAYSLQLTAPVAGQLGATTVPPSG 
. LSVPFSTWDQDHDLRRDKNCAKSLSGGWWF 
GTCSHSNLNGQYFRSIPQQRQKLKKGIFWKT 
WRGRYYPLQATTMLIQPMAAEAAS 


612 


1962 


A 


4375 


162 


3 


FFFb-lliSRSVAQAGVQWRDLSSLQPPPPG\SR 
GSPASASPVAOrrGTRHHRTRG 


613 


1963 


A 


4584 


687 


321 


PLAQRRPFLWVTVKTNGHTWGSSTYPHFWGS 
SNS/PASASQVAGIPNARHQARIIFVFLVEPRF 
HHVGRAGLOFLyNLAICXPQHPKVLOLQACN 
LNEKPHPAHKYISMIQFNVHFMCMSVHIYI 


614 


1964 


A 


4589 


727 


299 


PGSAQSAQRGRGRRRARAGSATQITMYSFMG 

GGLFCAWVGTILLWAMATDHWMQYRLSGS 

FAHQGLWRYCLGNKCYLQTDSIAYWNATRA 

FMILSALCAISGIIMGIMAF/GWVAVLMTFFA 

GIFYMCAYRVHECRRLSTHR 


61S 


1965 


A 


4590 


2 


414 


TILPEKIQAWAQKQCPQSGEEAVALWHLEK 

ETGRLRQQVSSPVHREKHSPLGAAWEVADFQ 

PEQVETQPRAVSREEPGSLHSGHQEQLNRKR 

ERRPLPKNARPSPWVPALADEWNIUIQEVTT 

TRLPAGSQEPVKD 


616 


1966 


A 


4592 


773 


488 


DFALVAQAGVQWHNLGSPQPLPPGFKRFSCL 
SLPSSWEYRCVPP/RLANFVFLVEMGFLHVGQ 
AGLELPTSGDPPALASQSACaTGVTTVPSGPG 


617 


1967 


B 


4595 


84 


478 


XRHGLREPLLE3UICAAASSFQHSSSLGRELPY 

DPVDTEGFGEGGDMQERFLFPEYHJDPEPQPT 

REKQLQELQQQQEEEERQRQQRREERKQQNL 

RARSREHPWGHPDPALPPSGVNCSGCXJAEL 

HCQDAR» 


618 


1968 


A 


4596 


2945 


1188 


ARSRNSARGVYGMCVOJTLFLCFLEDLERNDG 

SAERPYFMCSTLKKPLARRCFPAIHAYKGVL 

MVQ^ETTYEDGHGSRKNITDLVEOAKKANO 

VLEARQLAMRIFHDYTVSWYWniGLVIAMA 

MSLLSm ,1 .HI ,1 ■ AGIMGWVMIIMEIVSELGYRIF 

HCYMEYSRLRGEAGSDVSLVDLGFQTDFRV 

YUILRQTWLAFNOILSILEVIfflJ-LIELRKRILI 

AIALDCEASRAVOYVMCSLLYPLVTFFLLCUn 

AYWASTAVFLSTSNEAVYKIFDDSPCPFTAKt 

CNPETFPSSNESRQCPNARCXJFAFYGGESGYH 

RALLGLQIFNAFMFFWLANFVL/VLGQVTLAG 

AFASYYWALRKPDDLPAFPLFSAFGRALRYH 

KFAKCUvITCLKCX^FWCLEKFIKFLNRNAYIM 

lATyGTNFCTSARNAFFLLMRNIIRVAVLDKV 

TDFli^GKIirVGSVGILAFFFFTHRIRIVQDT 

APPLNYYWVPILTVIVGSYLIAHGFFSVYGMC 

VDTUa.ara^LERNDGSAHRPYFMSSTLKKL 

LNKTNKKAAES 


619 


1969 


A 


4601 


2 


357 


RTSVEPYILOEF/RKLSNNTKWKTBYKATEY 
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D=Aspartic Acid, E=Glutaniic Add, 
F=Phenylalanin6, G=Glycine, H=Histidine, 
IHteoleucine, K=Lysine, IMxucine, 
h^^eOuonins, N=Asparagme, P=Proline, 
Q=Glntamine, R=AiTginine, S=Serine, 
•P=Threonine, V=Valine, W=Tiyptophan, 
Y=Tyrosine, X=Unkiiovvn, *=Stop codon, 
^"possible nucleotide deletiai, \=possible 
nucleotide insertion 














GLAYGHFSYEFSNHRDVWDLQGWVTGNGK 

GLIYLTDPQIHSVIX)KVFTTNFGKRGIFYFFN 

NQHVECNEICHRLSLTRPSMEKPCKS 


620 


1970 


A 


4606 


1 


2415 


MERLWGLFQRAQQLSPRSSQTVYQRVEGPR 

KGHLEEEEEDGEEGAETLAHFCPMELRGPEP 

LGSRPRQPNUPWAAAGRRAAPYLVLTALUF 

TGAFLLOYVAFRGSCQACGDSVLWSEDVN 

YEPDLDFHQGRLYWSDLQAMFLQFLGEGRL 

EDTTRQTSLRERVAGSAGMAALTQDIRAALS 

RQKUJHVWTDTHYVGLQFPDPAHPNTLHWV 

DEAGKVGEQLPLEDPDVYCPYSAIGNVTGEL 

VYAHYCHPEDLQDLRARGVDPVGRLLLVRV 

GVISFAQKVTNAQDFGAQGVLIYPEPADFSQ 

DPPKPSLSSQQAVYGHVHLGTGDTPYTPGFPSF 

NQTQFPPVASSGLPSIPAQPISADIASRLLRKL 

KGPVAPQEWQGSLLGSPYHLGPGPRLRLWN 

NHRTSTP1NN1FGCIEGRSEPDHYVV7GAQRDA 

WGPGAAKSAVGTAILLELVRTFSSMVSNGFR 

PRRSLLFISWDGGDFGSVGSTEWLEGYLSVL 

HLKAWYVSLDNAVLGDDKFHAKTSPLLTSL 

IESVLKQVDSPNHSGQTLYEQVVFrN\PSWD\ 

AEVlRPLPlvM)SSAY\SFrAFVGVPAVEFSFME\ 

DIXMYPFLHTKEDTYENLHKVLQGRLPAVA 

QAVAQLAGQLLIRLSHDRLLPLDFGRYGDW 

LRHIGNLNEFSGDLKARGLTLQWVYSARGDY 

IRAAEKLRQEIYSSEERDERLTRMYNVRIMRV 

H^LSQYVSPADSPFRHIFMGRGDHTLQALL 

DHLRU,RSNSSGTPGATSSTGFQ\ESRFRRQL\ 

ALLNTWDACKQAANALSGDVWNIDNNF 


621 


1971 


A 


4610 


793 


334 


ISRVDDFVGSGIANVIIAVAIFSIPAFARLVRG\ 

NIlVlXQQriFIESARSIGASDMTVLLKHILPGT 

GSSIWTFTMRlGTSnSAASLSFLGLGAQPPTP 

EWGAMLNEARADMVIAPHVAVFPALAIFLTV 

LAFNLLGDGLRDALDPKKG 


622 


1972 


A 


4614 


2 


820 


LVYVMIAIFCIASAMSLYNCXAAUHKIPYGQ 

CriACRGKNMEVRUFLSGLCIAVAWWAVP 

RNEDRWAWILQDILGIAFCLNLIKTLKLPNFK 

SCVnXGLLLLYDVFFVFlimTKNGESIMVH- 

AAGPFGNNEKNDGNLYEATOQPSAPHEKLPV 

VIRVPKLIYFSVMSVCLMPVSILGFGDirVPGL 

LIAYCRRFDVQTGSSYnYVSVXTVAYAIGMIL 

TFVVLG\L\OCKGQPALLYLVPCTLITA/0QFV 

AWETVREMKKFWERVTS 


623 


1973 


A 


4619 


17 


691 


TLVSWHFVRRADLTREDLAPSSVDSGQAGF 

OOCCESQLPNTMPSAFSVSSFPVSIPAVLTQT 

DWTEPWLMGLATFHALCVLLTCLSSRSYRLQ 

IGHFLCLVILVYCAEYINEAAAMNWRLFSKY 

QYFDSRGMFISrVFSAPLLVNAKinVVMWVW 

KTIJmm)LKNAQERRKEKKRRRKEt>*GAA 

AAWSLRPSRPPSAAPSAAVCVAWASFQLTHG 

LKNRCFI 


624 


1974 


A 


4622 


164 


668 


SLEENHFYSYPEEVDDDLICfflCLQALLDPLD 

TPCGHTYCILCLTNFLVEKDFCPMDRKPLVL 

QHCKKSSILVNKLLNKLLVTCPEREHCrQVL 

QRCDLEHHFQTSQAWOTHL'SQLLGRLRQED 

CXSPGVHHCSEV 


625 


1975 


A 


4625 


474 


473 


CFLSPSPLLPPLLLSSSSSPSEPLPPPPTLLPSTCT 
PPL1JPSS*LSP 
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I>=Aspardc Add, E'KHutBmic Add, 
F^^tienyJalanine, 0=Glycine; H=Histidine, 
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Q=Ohitaniine, R=Arg&iine^ S=^erine, 
T=nireoninei V=Valine, W=TijT>tophan, 
Y"=Tyrosine, X-Unknown, *=StDp codon, 
A^ssible nucleotide deletion, V=possible 
nucleotide insertion 


626 


1976 


A 


4629 


249 


3 


KLKGNECFCYHC3>IVCJFLMIKK*GLFLC*IYFI 
LFFET*SHSFTRLECSGTISAHCSLQLQGSSNSP 
ASASQVAGIAGTHH 


627 


1977 


A 


4635 


1 


301 


tWtriKPPFAPQAGOQGPSRGSLNPLPTGLK 
QFSGLTLSRSGNNGPRPPPRVNFGILRGNGVP 
PGGAG*PRPPDLRGPPGLAPPQGGNNGGDPP 
ARAYL 


628 


1978 


A 


4648 


1357 


782 


KLFSSQRIJGPmQAINPSFLLLSFFPS*LLAMR 

TVONNAFILVFLVYRIVLLLF*HV*PAYFQPSK 

NKTAKINCN*RPFLFLVCYLL*AELHIGIFIANF 

YDCIPNKIJIEHLWPKLLQSLIFHVDFCGFLHK 

VFYiaTEFLLFLYFL*LFIIKVSCSn*CSTICVF 

SYKSFAVnFFVDNTRITSFGF 


629 


1979 


A 


4660 


18 


999 


HHELHTLELLQNPKEVLTRSEIQDVNYSLEAV 
K^^TVCQIPLMKE^ILKRFQVAVNLAEDTAH 

AGWRNPQKTAFVERFQHLSCVLCaCNVFTSG 
KHYWEVESRDSLEVAVGVCREDVMGITDRS 
KMSPDVGIWArYWSAAGYWPUGFPGTPTQQ 

TFSCSSVSRLKPFFWLSPLASLVIPPVTDRK*G 

FSSPDQNSFPWQLRDTHPWALFCPSCXYPG 

WSIFWVSLTVPFGICPLCASQEAVPWEVGLA 


630 


1980 


A 


4669 


2 


358 


FFFFFKrESHSVAQAGMQWKNLOSLPAPPPGF 
TPFFCLSLLNGWDYRRPPPHLANFFVLLVETG 
FHDVGODGLDLLTS*STPSASOSAFrrnVSHr 
TRUCKIRFAKGHVEFFFESHVE 


631 


1981 


A 


4674 


953 


614 


TPIRGTDDEHEECTVQEYSAGKNTCLRPGAV 
AHTCNPCTLGGRGRWIT*GSGVQDQPGPTWQ 
NPVFLERRPRALHSSPGLTTQRILWAQGLWV 
GAGSTGCSRGPRaEGVFREG 


632 


1982 


A 


4678 


34 


314 


RSIHASGMXSPSFOTMGHLUUJEFEILPSTPNP 

♦LPSYQGEAAGSSLISHLQWSPDLKGVYCTFP 

ASGLAPVPTHWTVSELSRSPVATATFC 


633 


1983 


A 


4696 


1 


1365 


RTLGMEGERRASQAPSSGLPAGOANOESPGG 

GAPFPGSSGSSAIXQAEVLDU5EDEDDLEVFS 

KDASLMDMNSFSPMMPTSPLSMINQIKFEDEP 

DLKDLPrrVDEraSHVlTlH'lVrryHnTKTSRG 

EFDSSEFEVRRRYQDFLWLKGKLEEAHPTUI 

PPIPEKFIVKGMVERFNDDFIETRRKALHKFL 

NRIADHPTLTFNEDFKIFLTAQAWELSSHKKQ 

GPGLLSRMGQTVRAVASSMRGYKNRPBEFM 

EMNNFIELFSQKlNLIDKISQRrYXEEREyFDE 

MKEYGPimLWSASEEDLVmnLKDVASCIDRC 

CKATEKRMSGLSEALLPWHEYVLYSEMLM 

GVMKRRDQIQAELDSKVEVLTYKKADITJLL 

PEEIGKm)KVECANNALKADWERWKQNM 

QNDIKLAFTDMAEENIHYYEQCLATWESFLT 

SQTNLHLEEASEDKP 


634 


1984 


A 


4708 


421 


158 


SYWVGEDYTYKFFEVILIDPFHKAIRRNPDTQ 
WISKAVYKHREMCGLTSTGRKSHGLEKDRM 
FPHAIGGSCRAA*RRRKTLQFPCYH 


633 


1985 


A 


4709 


42 


341 


YIKQPDAKERRRTVHWKKKlliSEASEinPPST 
PGVPQAPGHWEDYGRGDNFYLPH*DPGGIVL 
WNIFNRMPIARKNITDGEHHEYLIEVPRLFHT 
SED 


636 


1986 


A 


4721 


2 


351 


EKPDHFFPEGTSFIHEPRRPN*GDLVHCLGG1S 
RSriVrVA*LMQKLNLSKfl«>AYYIVIMKMSS 
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Amino acid sequence (A<=Alanine CK^steine, 
I>=Aspaitic Acid, EF=Glutanuc Add, 
F=Phenylalanine, G=Glycine, H=Histidine, 
Msoleucine, K=Lysine, Lr=Leucinc, 
M=Metliionine, N=Asparaginei P^Ptoline, 
Q=Glutaminci R=AiBininei, S=Serine, 
T=ThrBonine, V=Valinei W=Tryptoplian, 
Y=TyTosine, X=Unknown, ♦=St«q> codon, 
/=p<wsible nucleotide ddetion, V=possible 
nucleotide insertion 














ISPNFNSMDQPLDFQRHX3LRSPCYNRVPAQK 
MYFTIPSNHNAYQVDSVQST 


637 


1987 


A 


4726 


664 


253 


NTGLTCSIQRKCXJETQLyKREENRLILLLQDH 
LKSESFQVLTLSPRLEFSGLISAHCNLRLPGSS 
DSSASSSRAAGITGVHHHAWLIFFFLVETGFL 
HAG*AGLELLTSGDPPASASRSAGITGVSHHA 
RPRETRFL 


638 


1988 


A 


4734 


24 


592 


GGMDSRVSGTTSNGETKPVYPVMEKKEEDG 

TLERGHWNNKMEFVLSVAGEHGLGNVWRFP 

YUrmJGGGAFFIPYLVFLFTCGlFVFLLETAL 

GQYTSQGGVTAWRKICPIFEGXGYASQMIVIL 

LNVYYUVLAWALFYLFSSFTIDLPWGGCYHE 

WNTEHCMEFQKTNGSLNGTffiNATSPVIEFW 


639 


1989 


A 


4743 


1040 


699 


QGLTLLPRMECSATITAHCSLELPGSIDLPTSA 
S*VARTTGTHHHPWLILVLLL»TWGSYYVAQ 
AGUSLLGSSNLPAAMVSQSAQnGHDHCAWA 
TSNHVLYTQEGLRRGKEO 


640 


1990 


A 


4771 


527 


2 


GRIDCPHPAlViAQPll-lDACSVLCiAYQGAQN 

WIRIUUH3J>SGCIJCMNREIGPLQHSLCCPGWS 

QTPOLKAILLRQPPK*LOLQMESHSCPPAWSA 

MARSRLTATSASQVQAILLPQPPGTroSCSPS 

PDHEQQPLSWVLPPPQKDMNPREQQVALGP 

QAAALPWAVWRNDCFPR 


641 


1991 


A 


4780 


1^ 


473 


RPSSQCGGIPTGWKKGLAPELSSELSSPPLPAR 

LQLAASPYFSPSWAECPQPVPAGTHATWCLA 

RVWARMTPPGPAGIPSHPLPPPPPERSVPIPSP 

FPARDSGSRQGHSTDRYKHTDAPRDAHRRVP 

QRDTDTGVHTGSGTHTHAHTPPEIC 


642 


1992 


A 


4798 


1 


487 


GYSFRCDIVDYSRSPTALRMARTCWLYYFSK 
FIEIXIXnFFVLRKKNSQVTFLHVFHHTIMPW 
TWWFOVKFAAGGLGTFHALLNTAVHWMY 
SYYGLSALGPAYQKYLWWKKYLTSLQLVQF 
VIVAIHISQFFFMEDCKYQFPVFACIIMSYSFM 
FLLLFLH 


643 


1993 


A 


4799 


2 


391 


LMAFIEMHISGSLVYLKIKTICIYSYFSMLNFLL 

QEIPLSEILRlSSPRDFrNISQGSNPHCFEIITDT 

MVYFVGENNGDSSHNPVLAATGVGLDVAQS 

WEKAIRQALMPVTPQASVCTSPGQGKDHSK 

Q»ASVCrSPGQGKDHSKQ 


644 


1994 


A 


4800 


488 


101 


AYPLFAVHPVHTECVAGWGRAYLLCALFFL 
LSFLGYtaCAFRESNKEGAHSSTFWVLLSIFLG 
AVAMUXEQGITVLVRAATWLOPAFSVCTFP 
SYKDIWOWPCLCXjVLHAYIPLLV 


643 


1995 


A 


4805 


458 


126 


IXWTTVIXIQTOARPQSTMIHLGHILFLLLLPV 
AAAQTIPGERSSLPAFYPOTSGSCSGCGSLSL 
PLLAGLVAADAVASLUVGAVFLCARPRRSP 
AQEDGKVYINMPORQ 


646 


1996 


A 


4817 


47 


1033 


LQGDTWHLSFLSHFSRLHGGVPGRGLLEGNL 

LQPQAPGHDMTSIPFPGDRLLQVDGVILCGLT 

HKQAVQCLKGPGQVARLVLBRRVPRSTQQC 

PSANDSMGDERTAVSLVTALPGRPSSCVSVT 

DGPKF*SSN*KRIANGLGFSFVQMEKESCSHL 

KSDLVRIKRLFPGHPAEENGAIAAGDIILGRE 

WEGPRKASSSRCRGSWAMQLSVQAGPSFAS 

YYPAAVEVLHLLRGAPQEVTUXCRPPPGAL 

PELEQEWQTPaA\DKEFTRATCTDSCTSPIL 

GSRGQLGGTVPPQMQGKAWGLRPESSQKAIR 

EGTMGAKTERDLQPVP 


647 


1997 


A 


4854 


1044 


333 


PRVRGDWPLEKKKSNSNIHPIFSWCGSTDSKD 
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Amino acid sequence (A'^Alanine OCysteine, 
D=Aq)artic Acid. E=Glutamic Acid, 
F=Phenylalanine, G=Glycinc, H=Histidine, 
I=lsoIeucine, K=Lysinc, L=Leucine, 
M=Methianine, N=Asparagine, P=Pioline, 
Q=GhitaniiBe. R=Aiginine, S=Serine, 
T=Threonine, V=Valine, W=Tiypt(q)han, 
Y=Tyrosine, X^Unknown, *=Stop codon, 
/^possible nucleotide deletion, V^possible 
nucleotide insertion 














IVMPTTDLTDSVLETMGRVSLDMMSVQANT 

GPPWESKNSTAVWRGRDSRKERLELVKLSRK 

HPELroAAFTNFFFFKHDENLYGPrVKmSFFD 

FFKHKYQINIDGTVAAYRLPYLLVGDSWLK 

QDSrYYEHFYNELQPWKHYIPVICSNLSDLLEK 

LKWAKDHDEEAKKIAKAGQEFAKNNLMGD 

DIFCYYFQTFPRNMPIYK 


648 


1998 


A 


4867 


2030 


837 


AGMLPAVGSADEEEDPAEEDCPELVPMETTQ 

SEEEEKSGLGAKIPVTnTGYLGAGKTTLLNYI 

LTEQHSKRVAVILNEFOEGSALEKSLAVSQG 

GBLYEEWLELKNGCLCCSVKDNGLRAIENLM 

QKKGKFDYILLETTGLADPOAVASMFWVDA 

ELGSDIYLDGimVDSKYGLKHLAEEKPDGLl 

NEATRQVALADAILINKTDLVPEEDVKKLRT 

TIRSINGLGQILETQRSRYDLSNVLDLHAFDSL 

SGISI^KJaLQHVFGTQPIILDQSIVTrrFDVPG 

NAKEEHUIMnQNLL^V^KNVRNKDNHCMEV 

IRLKGLVSIKDKSQQVIVQGVHELYDLEETPV 

SWKDDTHITNRLVLLGRNLDKDILKQLFIAT 

VTETEKQWTTHFKEDQVCT 


649 


1999 


A 


4873 


226 


189 


DGVSLLLPKLGVQWAQYWAHWQPPLPGFKR 
FSCLSLRSSWD*KCAPFHPAFVFLVEMOFHRV 
GQAGLBLRTSGDPPASASQSAGrrGVSHLA*P 
TSMPLLPFQRLCVYI 


650 


2000 


A 


4874 


2 


437 


b FFLKKSFAFVAQAGVQWCDLGSPQPLPPGF 
K*FSCLSLPSSWDYRHAPPPCPS*FLYF**RQG 
FTMLARLVLNS*PHDLPTSPSQSAEIKOVSHR 
CPASFYLFLKYYLEAKFCA*OECAPSAGVGA 
GYKRGHKSCLLINCWQI 


6SI 


2001 


A 


4898 


1701 


771 


DAWGPETRLARILNPDSFIH?RPGRLPELEATR 

PHMEPKASCPAAAPLMERKFHVLVGVTGSV 

AAUCLPLLVSKLLDIPGLEVAVVTTERAKHFY 

SPQDIPVTLYSDADEWEMWKSRSDPVLHIDL 

RRWADLLLVAPIJ)ANTLGKVASGICDNLLTC 

VMRAWDRSKPIXFCPAMNTAMWEHPITAQQ 

VDQLKAFGYVEIPCVAKKLVCXiDEGLGAMA 

EVGnVDKVKEVLFQHSGFQQS*PGISVMGVP 

LYSEWVQAKSVKMDVGKIGGYPHLLNGGPA 

LSLPRGQACSRLNWTEGPOLSFFQPGEAAA 


652 


2002 


A 


4927 


1 


611 


FRGRQTSRPARGFSPWBPPGTMQEPSSGECPA 

SP*U>CASNRLAFGGLIFPCAPLVPYPAPFSPLL 

PAFSCAPRPRAHTHSRTHPSAPLVPKPSSRAR 

GQSPIPSRASSPSCSWAQVPGVALARCAGVC 

KPGDSWRVAAaSQRCCaiGRRRGSGPRNPE 

QSFRGAWOPSFWGSWKSQRELSAGGAQAWP 

LLGSAGSGLRGEA 






A 


4965 


2 


283 


FFJ-FJ'UOVSLCHPGWNAVARSWLTATSASR 
VQAVSCFRLPSSWDYRHATMPG*FF»YF**R 
WOFTaAILVLNS*PQVICPPWPPKVLTLQA 


654 


2004 


A 


4968 


3 


437 


RPGIPGRRFRRSWFCQLP*EPEPGLESLATPGD 
IPAVGLGALGVIPPVRVPQRPPTQRSQGRGW 
DPERDPGCRVOVSlKiPllTJfiKnKTPOI OnCTJP 

PPCLTHLAAASCWVWCGRWKRDSAECQCD 
HSCSAVSQQEDRCRSSSCS 


655 


2005 


A 


4983 


201 


397 


NlNNNTTCIQPSMISSMALPnYILLCIVGVFGN 
TLSQWIFLTKIGKKTSTHIYLSHLVTANLLVC 


656 


2006 


A 


4988 


332 


159 


LVHKDMYREFFEEEAQASNKHVTRCLTSLVI 
REVHKTMR'HFLPIRLEKNKNNIKD 


657 


2007 


B 


5008 


129 


465 


MAGMKTASGDYIDSSWELRVFVGEEDPEAES 
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/^possible tnideotide deletion, V=possible 
nucleotide insation 














VTLRVTGESHIGGVLLKlVEQI>aUCQDWSDH 

AIWWEQKRQWLLQTHWTLDKYGILADARLF 

FGPQHRPVILRLPNRRALRLX* 


6S8 


2008 


A 


5017 


1 


292 


Fm'KETESHSVTQAOVQWHDLGSLQPPPPCT 
KRFSCLSLLSSWDYRCAPPHPANFVFLVETGF 
HHVAQAGLKIXTL*SANLGLSTSLP1PLFILLS 


659 


2009 


A 


5018 


17 


338 


RGHGGKSLTGGTPGNWGDGLLVSEDWSHLIF 
T*NSLVSPVLOKWSPCLQOPOLSAVHTWPWL 
1V1\ACWAVHVKTHMRPGLAVLPRLVLNSWS 
♦AHLLWPPKALGLQA 


660 


2010 


A 


5028 


2 


310 


SRVDDFVGERRGGCDBCLCGHRGLRAVPLG 
HPGHLCLQPPGGPA*FLDYCRGCCPHPVPGST 
AGSCPRQKKTTPGPTVLCVCSFWIYQRGEPH 
HRTGARWNH 


661 


2011 


A 


5050 


752 


431 


RQSCSSTQAKVQWFHYGPLQSQPPGLKQSSQ 
LSLPNSRDHRHVPPRLAIFSFAETGSPYFAQAS 
LELLGSSHPPTSASQSARITGVSHRAWPLK*F 
NLNQYQTLTMN 


662 


2012 


A 


5054 


48 


103 


ELNNGPFQMPLCNGGNLAVTGSWADRSPLH 

EAASQGRIXALRTLLSQGYNVNAVTLDHVTP 

LHEACLGDHVACARTLLEAGANVNArriDGV 

TPLFNACSQGSPSCAELLLEYGAQAQLESCLP 

SPTHEGASKGHHECLDILISWGIDVDQEIPHSG 

TPLYVACMAQQFHCIWNLIYAQAGVRKGKY 

WDTPLPGAGHQSTQKLE*LFAMVEIWQ 


663 


2013 


A 


5066 


951 


580 


VKNS»SFAHCASVYKHHYMDGQTPCLFVSSK 
ADLPEGVAVSGPSPAEFCRKHRLPAPVPFSCA 
GPAEPSTTIFTQLATMAAFPHLVHAELHPSSF 
WLRGIXGWGAAVAAVLSFSLYRVLVKSQ 


664 


2014 


A 


5071 


550 


1 


LSFEEVLSMEQVNKTVYREFWLQFSSLARLQ 

QLLFVIFLLLYLFTLGTNAinSTTVLDRALHTP 

MYFFLAILSCSEICYTFVTVPKNILVDLLSQKK 

nSFLGCAIQMFSFLFFGSSHSFLLAAMGYDR 

YMAICNPLRYSVLMGHGVCMGLMAAAWAC 

GFTVSLVTTSLVFHLPFHSSNQHE 


665 


2015 


A 


5074 


496 


692 


QQYHNTGSAGHHAHCQVGHSPHVHYPSGCG 

PL*IQRGLPSFNSLEGHSLBa)SGHEESVQLDSE 

HDVQRSLYCDTAVNDVLNTSVTSMGSQMPD 

HDQNEGFHCREECRILGHSDRCWMPRNPMPI 

RSKSPEHVRNIIALSIEATAADVEAyDDCOPT 

KRTFATFQKDVSDHPAEERPTLKGKRTVDVT 

ICSPKVNSVIREAGNGCEAISPVTSPLHLKSSL 

PTKPSVSYEIVDPGITARRC 


666 


2016 


A 


5080 


408 


248 


IMLLSTSS'VYFQSSTKDSHFFLFDFQKTGPPL 
VGPKAQLSGLQLQPCLYKRR 


667 


2017 


A 


5081 


129 


247 


DLTNSHFFLFDFQKTGPPLGGPKAQFSSLQLQ 
PCVY*RR 


668 


2018 


A 


5086 


852 


233 


NUCSNDRWVQIKTAYKYFPKNGDNYNWVF 

RALPTTFADIEhlLKYLLFTRDASQPFYLGHTV 

IFGDLEYVTVEGGIVLSRELMKRLNRLLDNSE 

TCADQSVTWKLSEDKQLAICLKYAGVHAENA 

liLI 1 JKuKUVrrf 1 KJ'lAIJlJliliALoNNFQQVVEG 

CCSDMAITFNGLTPQKMEVMMYGLYRLRAF 

GHYFNDTLVFLPPVGSEND 


669 


2019 


A 


5101 


1 


329 


PGRPTRPPIX,TLLAHVSPEPAGPSCDSLAQPO 
ASOy*VQHDSHPPLLCGSQCXSEPVPGSHGPP 
RGCQHEAAPCPRGPGSDGLHHASAACASLPP 
SPILPVUPELGPL 


670 


2020 


A 


5102 


3 


547 


DAWGNRCAVGAAPRLIHLHlXXTPADPSRkP 
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M=MethiQnine, N=Asparagine, P^PtoUne, 
Q=Ghitamine, R=A];giiiine, S=Serine, 
T=Threonine, V=Valine, W=Tiyp1oiAan, 
Y=Tyrosine, X=Unknown, *=«top codon, 
/^possible nucleotide deletion, V^ssible 
nucleotide insertion 














DEL*NMNGRVDYLVlEliIlINLTRGPSGLGFNl' ' 

VGGTDQQYVSNDSGIYVSRDCENGAAALDGR 

LQEGDKILSVNGQDLKNLLHQDAVDLFRNA 

GYAVSLRVQHRLQVQNGPIGHRGEGDPSGIPI 

FMVLVPVFALTMVAAWAFMRYRQQL 


671 


2021 


A 


SIOS 


672 


400 


RDGREELCLQQEPTLPSRICSSAPLLYFLFICPF 
VllIJLIXISUXXYWKAIUCLSrnJlSNrRKBCA 
LWVDLKEAGGVTTORMED*EEDECN 


672 


2022 


A 


5148 


72 


314 


nVFSYNU-UUl'ELLNUVHaJCQALNGLSQLT 
YTSGNPTKRQSQLIDTLQHQVKSUEQQLAVS 
NQAHGALQEYVLAPCS 


673 


2023 


A 


5152 


210 


335 


REIL(:SWGRLNW*MSLFPNLTC3a,NAIPIKIPA 
NHFVEVT 


674 


2024 


A 


5153 


3 


2953 


LTEDQPFDILQKSLQEANITEQTLAEEAYLDA 

SIGSSQQFAQAQLHPSSSASFTQASNVSNYSG 

QTLQPIGVTHVPVGASFASNTVGVQHGFMQH 

VGISVPSQHLSNSSQISGSGQIQUGSFGNHPS 

MMTINNLDGSQIILKGSGQQAPSNVSGGLLV 

HRQTPNGNSLFGNSSSSPVAQPVTVPFNSTNF 

QTSLPVHNHIQRGLAPNSNKVPINIQPKPIQM 

GQQNTYNVNNLGIQQHHVQQGISFASASSPQ 

GSWGPHMSVNTVNQQNTRKPVTSQAVSSTG 

GSIVIHSPMGQPHAPQSQFLIPTSLSVSSNSVH 

HVQTINGQLLQTQPSQLISGQVASEHVNiLNR 

NSSNMLRTNQPYTGPMLNNQNTAVHLVSGQ 

TFAASGSPVIANHASPQLVGGQMPLQQASPT 

VLHLSPGQSSVSQGRPGFATMPSVTSMSGPSR 

FPAVSSASTAHPSLGSAVQSGSSGSNFTGDQL 

TQPNRTPVPVSVSHRLPVSSSKSTSTFSNTPGT 

GTQQQFFCQAQKKCLNQTSPISAPKTTDGLR 

QAQIPGLLSTTLPGQDSGSKVISASLGTAQPQ 

QEKWOSSPGHPAVQVESHSOGQKRPAAKQ 

LTKGAFILQQLQRDQAHTVTPDKSHFRSLSD 

AVQKLLSYHVCQQSMPTEEDLRKVDNEFETV 

ATQIXKRTQAMUOCYRCLLLEDAlvIRINPPAE 

MVMIDRMFNQEERASLSRDKRLALVDPEGFQ 

ADFCCSFKLDKAAHETQFGRSDQHGSKASSS 

LQPPAKAQGRDRAKTGVTEPMNHDQFHLVP 

NmWSAEGNISKKTECLGRALKFDKVGLVQ 

YQSTSEEKASRREPLKASQCSPGPEGHRKTSS 

RSDHGTESKLSSILADSHLEMTCNNSFQDKSL 

KNSPKNEVLHTOIMKGSGEPQPDLQLTKSLET 

TFKNILELKKAGRQPQSDPTVSGSVELDFPNF 

SPMASQBNCLEKFIPDHSEOWETDSILEAAV 

NSDUBC 


675 


2025 


A 


5154 


599 


1880 


LKKMEPFSC01PVALPPATVDNRIIFGKNSDR 

LYDEVQEWYFPAWHDNUjERLKCTYIEID 

QVPETYAWLSRPAWLWGAEMGANEHGVa 

GNEAVWOREEVCDEEALLGNdDLVRLGLERA 

DTAEKALNVIVDLLEKYGQGGNCTEGRMVF 

SYHNSFIJADRNEAWILETAGKYWAAEKVQE 

GVKNlSNQI^ll IKiAREHPDMRNYAKRKGW 

WDGKKEFDFAAAYSYLDTAKMMTSSGRYCE 

GYKLLNKHKGNTTFETMMEILRDKPSGmME 

OEFLTTASMVFILPQDSSLPCIHFFTGTPDPER 

SVFKPFIFVPHISQUDTSSPTFELEDLVKKKS 

HFKPDRBHPLYQKHQQALEVVNNNEEKAKI 

MLDNMRKLEHELFREMESILQNKHLDVEKIV 

NLFPQCTKDEIQIYQSNLSVKVSS 
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nucleotide insertion 


676 


2026 


A 


5155 


2 


306 


l-l-FLKRSLALSPRPDCGLQWKNLGSJLQAPPPG 
FTPFSCLSLPSSWDYRRPPPRPANFLYF**RRG 
FnXARMVSIS*PHDPPASASQSAGrrGVSHRA 
RPT 


677 


2027 


A 


5167 


97 


740 


FFHSVDLLALEQSK.TPYKPDWFDIVESEVKCC 

KEAVCVIDMSSFTEFErrSTGDQALEVLQYLF 

SNDLDVPVGHIVHTGMLNEGGGYENDCSIAR 

LNKRSFFMISPTDQQVHCWAWLKKHMPKDS 

NLLLEDVTWKYTALNLIGPRAVDVLSELSYA 

PMTPDHFPSLFCXEMSVGYANGIRVMSMrHT 

GEPGFMLYlPlEYRWGFTMLSn-VSNS 


678 


2028 


A 


5183 


1919 


2018 


PALCRLRDDMTVCVADFGLSKKIYSGDYYRQ 
GRIAKMPVKWIAIESLADRVYTSKSDVWAFG 

VT^^WEIATRG^f^>YPGVQ^a^E^lYDYLLHO 

HRLKQPEDCLDELCKI**SPQSP 


679 


2029 


A 


5190 


39 


499 


RESQVKHFKMRKIDLCLSSEGSEVILATSSDE 
KHPPENIIIXjNPETFWTTTGMFPQEFnCFHKH 
VRIERLVIQSYFVQTLKIEKSTSKEPVDFEQWI 
EKDLVHTEGQLQNEEIVAHDGSATYLRFnVS 
AFDHFASVHSVSAEGTWSNLSS 


680 


2030 


A 


5204 


541 


92 


EILAVLKLACGDISLNALALMVATAVLTLAPL 

LLICLSYLFILSAILRVPSAAGRCKAFSTCSAH 

RTWWFYGTISFMYFKPKAKDPNVDKTVAL 

FYGWTPSLNPIIYSLRNAEVKAAVLTLLRGG 

LLSRKASHCYCCPLPLSAGIG 


681 


2031 


A 


5207 


10 


247 


VPDNGDVTfaPVCSTLVEETSLTVSEAMEQSl 
KNESPLPGTLAHrCNTSTLGGRGRWIT*GREF 
DTSMANMVKPCLYRK. 


682 


2032 


A 


5210 


2 


231 


FPFETESYSITQAGVQWPNLSSLKTLPPGFK»F 

SCLSLPSSWDYRCLPPCPAMFCIFSRNGVLPC 

WPOWSRTTOLS 


683 


2033 


A 


5218 


85 


402 


CPSVSGLIKSDLRRHNINIG]TNVDVKAVS^aF 
MIlLLRSMYRlNVKPYFFI*LFFSRVNC»SVnG 
YARCYrFLIF*LFL*IPADSPTDQEPKTVMLSK 
QSESAI 


684 


2034 


A 


5220 


1 


194 


NLMKEMQNLNSENHKTWEEYKDTK*IMSYF 
YG»ALNVIKMAVIJTCmYRFSATLVKIPQHL 
TDS 


685 


2035 


A 


5228 


260 


440 


LHSQDGNSDPRKPQGEMSAHAFPVQTCGEED 
QKKTPQVPINFTELSKCS*S*KIMSGERB 


686 


2036 


A 


5239 


79 


508 


GQEAAARAAKLSSPRPHRVGRRERGVGGMS 
AFSEAALEKKLSELSNSQQSVQTLSLWLIHHR 
KHSIU'IVTVWEKELRKAKPNRKLTFLYLAND 
VIQNSKRKGPEFIXDFAPVIVEAFKHVSSETD 
ESCKKHLGRVLSIWEERS 


687 


2037 


A 


5244 


1 


428 


MAAWAATALKGRGARNARVLRGILAGATA 
NKASHNRTRALQSHSSPEGKEEPEPLSPELBYI 
PRKRGKNPMKAVGLAWAIGFPCGH-LFILTKR 
EVDKDRVKQMKARQNMRLSNTGEYESQRFR 
ASSQSAPSPDVGSOVQT 


688 


2038 


A 


5249 


1 


1407 


LQQTEDKSLLNQGSSSEEVAGSSQKMGQPGP 

SGDSDLATALHRLSLRRQNYLSEKQFFAEEW 

QRKIQVLAI)QKEGVSGCVTPTESLASLCTTQS 

EITDLSSASCLRGFMPEKLQIVKPLEGSQTLy 

HWQQLAQPNLGTILDPRPGVnXGFTQLPGD 

AmnSDLEEDEEEOriTQVQQPLEVffiKLSTS 

KPVTGIFLPPITSAGGPVTVATANPGKCLSCr 

NS'lKl K1TCR1LHPSDITQVTPSSGFPSLSCX3SS 

GSSSSNTAVNSPALAYRLSlGESimRRDSTTT 
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N4»*4eUiioninc, N=Asparagine, P=Proline, 
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Y=Tyrosine, X=Unknown, ♦=Stop codon, 
/=I)ossible nucleotide deletion, ^lossiblc 
nucleotide tnsotion 














fsstmslakllqergisakvyhspisenplqmT 

pkslaipsttpnspshspcpsplpfeprvhlsen 

flasrpaeiflqemyglrpsrnppdvgqlkm 

nlvdrlkrlgiarwknpgaqengrcqeaei 

gpqkpdsavylnsossllgglrrnqslpvim 

gsfaapvctsspkmovlked 


689 


2039 


A 


5254 


2 


2621 


LSLFGSRALGRSGARAMAKAKKVGARRKAS 

GAPAGARGGPAKANSNPFEVKVNRQKFQnXS 

RKTRHDVGLPGVSRARALRKRTQTIXKEYKE 

RDKSNVF1U)KRFGEYNSNMSPEEKMMKRPA 

LEQQRHHEKKSXYNLNEDEELTHYGQSLADIE 

KHNDIVDSDSDAEDRGTXSGELTAAHFGGGG 

GLLHKKTQQEGEEREKPKSRKELIEEHAKSK 

QEKRERQAQREDALELTEKLDQDWKEIQTLL 

SHKTPKSENRDKKEKPKPDAYDMMVRELOF 

EMKAQPSNRMKTEAELAKEEQEHLRKLEAE 

RLRRMLGRDEDENVKKPKHMSADDLNDGFV 

LDKDDRRLLSYKDGKMNVEEDVQEEQSKEA 

SDPESNEEEGDSSGGEDTEESDSPDSHLDLBS 

NVESEEENEKPAKEQRQrrPGKGLISGKERAG 

KATRDELPVTFAAPESYEELRSLLLGRSMEEQ 

LLWERIQKCNHPSLAEGNKAKLEKLFGFLLE 

YVGDLATDDPPDLTVIDKLWHLYHLCQMFP 

ESASDAIKFVLRDAMHEMEEMIETKGRAALP 

GLDVLfYLKlTGLLFPTSDFWHPWTPALVCL 

SQLLTKCPILSLQDWKGLFVCCLFLEYVALS 

QRFIPELINFLLGILYIATPNKASQGSTLVHPFR 

ALGKNSELLWSAREDVATWQQSSLSLRWA 

SRLRAPTSTEANHIRLSCLAVGLALLKRCVLM 

YOSLPSFHAIMGPLRALLTDHLADCSHPQELQ 

ELCQSTXTEMESQKQLCRPLTCEKSKFVPLKL 

FTPRLVKVLEFGRKQGSSKEEQERKRLIHKHK 

REFKGAVREIRKDNQFLARMQLSEIMERDAE 

RK31KVKQU!NSLATQEGEWKALKRKKFKK 


690 


2040 


A 


5261 


1 


304 


I-PFFVFLVKIXJFHHVGQAGLELLTSGDPPTW 
ASQSAOnXiVSHCSWPVIYVLSTLLHAVRNVL 
FKRTFPLKSSSFLSYDKEIFPIUVLKFYLVTLT 
SFVK 


691 


2041 


A 


5270 


3 


158 


NtJH 1 THCTANWVHLPGTPPGWKIDOPAAAL 
EVLSSFFFFFLKFSYKPQNTV 


692 


2042 


A 


5282 


56 


1268 


OMEPVGCCGECRGSSVDPRSTFVLSNLABW 

ERVLTF1J>AKAIJLRVACVCRLWRECVRRVLR 

THRSVTWISAGLAEAGHLEGHCLVRWAEEL 

ENVRILPHTVLYMADSETFISLEECRGHKRAR 

KRTSMETALALEKLFPKQCQVLQIVTPGIWT 

PMGSGSNRPQEIEIGESGFALLFPQIEGIKIQPF 

HFKDPKNLTLERHQLTEVGLLDNPELRWLV 

FGYNCCKVGASJm-QQWSTTSDMNIILAGG 

QVDNLSSLTSEKNPLDIDASGWGLSFSGHRI 

QSATVIXNEDVSDEKTAEAAMQRLKAANIPE 

HNTIGFMFACVGRGFQYYRAKGNVEADAFR 

KFFPSVPLFGFFGNGEIGCDRIVTGNFIUIKCN 

EVKDDDLFHSYTTIMALIHLGSSK 


693 


2043 


A 


5301 


362 


507 


EEIKERFGPGLVIYWYGFIQELDCNRERGILLK 
ACFPTNIVTLCHSIA 


694 
69S 


2044 
2045 


A 
A 


5310 
5315 


1 

125 


"204 
T596 


RVLTAINHTLKENLRKFYKGKKDKPLDLRPK 
KTRAMRRRLNMHEENLKTKKQHRKERLYPL 
RKYAAKA 

b-lKiriAVKSEVQVCISU.LCLH)RTMPKKAKP 
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Amino acid sequence (A=Alaiiine CK^^rsteine, 
D=Aspsitic Acid, E°<}lutamic Add, 
F^Phwiylalaninc, G=Glycine, IHIisttdine, 
Msoleucine, K=Lysine, b=Leucinc, 
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T=Thioonine, V=Valine, W=Tiyptophan, 
Y=Tyrosine, XKJnknown, *=Stop codon, 
^t>os^lc nucleotide deletion, V=^ssible 
nucleotide insertion 














TGSGKEEGPAPCKQMKLEAAGGPSALNFDSP 

SSLFESLISPKTETFFKEFWEQKPLLIQRDDPA 

LATYYGSLFKLTDLKSLCSRGMYYGRDVNV 

CRCVNGKKKVLNKDGKAHFLQLRKDFDQKR 

ATIQFHQPQRFKDELWRIQEKLECYFGSLVGS 

NVYTTPAGSQGLPPHYDDVEVFILQLEGEKH 

WRLYHPTVPLAREYSVEAEERIGRPVHEFML 

KPGDLLYFPRGTIHQADTPAGLAHSTHVTIST 

YQNNSWGDFLLDTISGLVFDTAKEDVELRTG 

IPRQIJXQVESTTVATRRLSGFLRTLADRLEO 

TKELLSSDMKKDFIMHRU'PYSAODOAELSTP 

GGKLPRLDSVVRLQFKDHIVLTVLPDQDQSD 

ETQEKMVYIYHSLKNSRETHMMGNEEETEFH 

GLRFPLSHLDALKQIWNSPAISVKDLKLTTDE 

EKESLVLSLWTECLIQW 


696 


2046 


A 


5318 


1476 


742 


LMKXYLEAAELGEISDnnKLLRLSSSQGTIET 

SLQDIDSRLSPGGSLADAWAHQEGTHPKDRN 

VEKIXJVIXNCavITEIYYQFKKDKAERRLAYN 

EEQmKFDKQKLYYHATKAMTHFTDECVKK 

YEAFLNKSEEWIRKMLHLRKQLLSLTNQCFDI 

EEEVSKYQEYTNELQETLPQKMFTASSGIKHT 

MIPIYPSSNTLVEMTLGMKKLKEEMEGVVKE 

LAENNHDLESGGSLTMDGGLRNVDCL 


697 


2047 


A 


5320 


244 


478 


LDYNFfLFHvflTGLVSQAGVQWHDLGSLQPP 
PPGFKQFSCLSLPSSWDYRHLPPHLANFSREG 
VSPSWPGWSRTPDFR 


698 


2048 


A 


5324 


266 


714 


LPIRKSLRSVRSGFPTSQSPrrRNLDOTASGSC 

LAKTVTGSLFRINVGLRGLVAGGUQALLGTP 

VGGLLMAFQKYSGETVQERKQKDRKALHEL 

KLEEWKGRLQVUEHLPEKIESSLQEDEPENDA 

KHEALLNLPRNPSVIDKQDKD 


699 


2049 


A 


5334 


699 


277 


RPHGHLVCISSSAOLSGVNGLADYCASKFAA 

FOFAESVFVETFVQKQKGIKTTIVCPFFIKTGM 

FEGCTTGCPSLLPILEPKYAVEKIVEAILQEKM 

YLYMPKLLYFMMFLKSFLPLKTGLLIADYLGI 

LHAMDGFADQKK 


700 


2050 


A 


5344 


3 


614 


PTAEEMSSLTPESSPELAKRSWFGNFrSU)KEE 

QIFLVLKDKPLSSKADIVHAFLSIPSLSHSVLS 

QTSFRAEYKASGGPSVFQKPVRFQVDISSSEG 

PEPSPRRDGSGGGGIYSVTFnJSGPSRKFKRV 

VETIQAQIXSTHDQPSVQALADEKKGAQTRP 

AGAPPRSLQPPPGRPDPELSSSPRRGPPKDKK 

LLATNGTPL 


701 


2051 


A 


5346 


3 


1383 


HASVLFCRVMAASJOXKjAVARMQEDRDGSC 

STVGOVGYGDSKDCILEPLSLPESPGGnTLE 

GSPSVPOFCEEHFPVAEQDKLLKHMIIEHKIV 

lADVKLVADFQRYILYWRKRFTEQPrrDFCSV 

ERINSTAPFEEQENYFLLCDVLPEDRILREELQ 

KQRLREILEQQQQERNDTNFHQVCMFCNEEF 

LGNRSVILNHMAREHAFNIGLPDNIVNCNEFL 

CnXJKKLDNLQCLYCEKTFRDKNTLKDHMR 

KKQHRKINPKNREYDRFYVINYLELGKSWEE 

VQLmJDRELLDHQEDDWSDWEEHPASAVCL 

FCEKQAETIEKLYVHMEDAHEFDLLKIKSELG 

LNFYQQVKLVNFIRRQVHQC31CYGCHVKFKS 

KADUITHMEETKHTSLLPDRKTWEK2LEYYFP 

TYENDTIXWTLSDSESDLTAQEQNENVPIISE 

DTSKLYALKQSSILNQLLL 


702 


2052 


A 


5356 


2502 


1540 


MAAATRGCRPWGSLLGLLGLVSAAAAAWD 



227 



wo 01/57188 



PCT/USOl/03800 



SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
bod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

bcgtmuog 

nucleotide 

location 

correspond! 

ng to first 

amino add 

residue of 

pqjtide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=°Alaiiine 0=Cysteine, 
D=Aspartic Acid, E=Ghitamic Acid, 
F=PhaiylaIanine, G=G!yeine, HNHistidine, 
I^Isoleudne, K=Lysine, Lr=Ceucine, 
K^Meduonine, N=Aq)aTagine, P=Pn>line, 
Q=Ghitamine, R=Aigiiine, S=Serine^ 
T=Tlireonine, V=Valine, W=TiyptO(rfian, 
Y=TyTOsine, X^Unknovm, *=Stop codon, 
A=possible nucleotide deletion, V=possible 
nucleotide insertion 










- 




LASLRCILGAFCECDFRPDLPGLECDLAQHL 

AGQHLAKALWKALKAFVRDPAPTKPLVLSL 

HGWTGTGKSYVSSLLAHYLFQGGLRSPRVH 

HFSPVLHFPHPSHDERYKKDLKSWVQGNLTA 

CGRSLFLFDmDKMPPGLMEVLRPFLGSSWV 

VYGT^mUCAIFIFISNTGGEQINQVALEAWRS 

RRDREEILLQELEPVISRAVLDNPHHGFSNSGI 

MEERLLDAWPFLPLQRHHVRHCVLNELAQL 

GLEPRDEWQAVLDSTTFFPEDEQLFSSNGCK 

TVASRIAFFL 


703 


2053 


A 


5380 


278 


657 


LFLQKU^MKTEEEARTHTEIEl^IFLRKEQQKL 

EERLEFWMBKYDKDTEMKQNELNALKATKA 

SDLAHLQDLAKMmEYEQVnEDRDaOERSKK 

KVKQDLLELKSVIKLQAWWROTMIRREIOGF 

KM 


704 


2054 


A 


5381 


1 


1003 


FRGRAVKMAAWEVEVGGGAAGERELDEV 

DMSDLSPEEQWRVEHARMHAKHRGHEAMH 

AEMVLILIATLWAQLLLVQWKQRHPRSYN 

MVTLFQMWVVPLYPTVKLHWWRFLVIWILF 

SAVTAFVTFRATRKPLVQTTPRLVYKWFLLIY 

KISYATGIVGYMAVMFTLFGLNLLFKIKPEDA 

MDFGISUJYGLYYGVLERDFAEMCADYMA 

STIGFYSESGMPTKHLSDSVCAVCGQQIFVDV 

SEEGIIENTYRLSCNHVKHEFCIRGWCIVGKK 

QTCPYCKEKVDLKRMFSNPWERPHVMYGQL 

LDWLRYLVAWQPVnGWQGINYILGLE 


705 


2055 


A 


5396 


3 


675 


lYDRDPLQLATRAGQPLDINMAGEPKPYRPKP 

GNKRPLSALYRLESKEPFLSVGGYVFDYDYY 

RDDFYNRLFDYHGRVPPPPRAVIPLKRPRVA 

VTTTRRGKGVFSMKGGSRSTASGSTGSKLKS 

DELQTKKELTQIKTKIDSVLGRLDKIEKQQK 

AEAEAQKKLLEESLVLIQEECVSEIADHSTEEP 

AEGGPDADGEEMTDGIEEAFDEDGGHELFLQ 

K 


706 


2056 


A 


5410 


2 


98 


GRVGLNLEGRGCSBPKWRHC'JKlWAreQDSI 
S 


707 


2057 


A 


5415 


6 


287 


PFKLTPSFLSHAFSSGQBRKVFIELNHIKKCNT 
VRGVFVLEH?GNYTILLLGLDSHGSNSNLGAP 
EEGLGAGRKRTSVEKSGGAGVTRiOaiDP 


708 


2058 


A 


5423 


3 


291 


SSSNPLGSPSTLWKLCSFVLHNKSCCCSFFGS 

TPTUIAITLTVRVCGFIPEVSKTTNPLGRTNNS 

GCTIFKTVTLTARSTASLIJCSVRPRTHQKE 


709 


2059 


A 


5424 


679 


347 


RIRHEEKRGSRGRGRRTSEEDTPKKKKHK.GG 
SEFTOTILSVHPSDVUJMFVDPNEPTYCLCHQ 
. VSYGH^GCDNPDCTBEWFHFACVDLTTKPK 
GKWFCPRCVQEKRKKK 


710 


2060 


A 


5442 


1073 


539 


QESLKKKIQPKLSLTLSSSVSRGNVSTPPRHSS 
GSLTPPVTPPrrPSSSFRSSTPTGSEYDEEEVDY 
EESDSDESWTTESAISSEAILSSMCMNGGEEK 
PFACPVPGCKKRYKNVNGIKYHAKNGHRTQI 
RVRKPFKCRCGKSYKTAQGLRHHTINFHPPV 
SAEHRKMQQ 


711 


2061 


A 


5449 


1 


319 


QDSLCVPQYNKYREERVILFLKMASGHAFQP 
DLVXRIRDAIRMGLSAIUIVPSLILETKGIPYTL 
NGKKVEVAVKQHAGKAVEQGGAFSNPETLD 
LYRDIPELQGF 


712 


2062 


A 


5499 


91 


749 • 


RPTPQHGDFWMQPLTKDAGMSLSSVTLASAL 
QVRGEALSEEEIWSLUaAAEQLLEDLRNDSS 
DYWCPWSALLSAAGSLSFQGRVSHIE/VAPF 
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I>=iAspartic Add, £=Glutamic Add, 
F^henylalanine, G=Glycine, H=Histidine. 
^Isoleucine, K=Lysine, Lp=Leucine, 
M^^eUiionine, N=Asparagine, P=Proline, 
Q^Ghdamine, R=Aiginine, S=Serine, 
T=Threonine, V=Valine, W=Tiyptopiian, 
Y=Tyrosin6, X=<Jnknown, *=SU}p codon, 
^ix)ssible nucleotide dd^on, V=possible 
nudeotide insertion 














KAPELLQGQSEDEQPDASQMHVYSLGMTLY 
WSAGFHVPPHQPLQLCEKJISILLTMCEDQPH 
RRCTLQSVLEACRVHEKEVSVYPAPAGLHIR 
RLVGLVLGTISEVSREPCFSSSSCWSCVAIKI 


713 


2063 


A 


3506 


22 


478 


VEELILVSRIJ3PHlKrPMYFFLAHLSFLDLSFT 
T^SIPQLLYNU^GCDK'nsyMGCAIQLFLFLGL 
GGVECLLLAVMAYDRCVAICKPLHYMVIMN 
PWXaiOLVSVTWGCGVANSLAMSPVTLRLPR 
CGHHEVDHFLCEMPALIRMAaSTV 


714 


2064 


A 


5514 


25 


220 


AIRPYWCENNUGIGKLSTADGKAFADPEVLR 
RLTSSVSCALDEAAAALTRMRAESTANAGQS 
DK 


715 


2065 


A 


5526 


3 


810 


KVTAPRBPQRYSSGHGSDNSSVLSGELPPAM 

GRTALFHHSGGSSGYBSLRRDSEATGSASSAP 

DSMSESGAASPGARTRSLKSPKKRATGLQRJR 

RLIPAPLPDTTALGRKPSLPGQWVDLPPPLAG 

SLKEPFEKVYEIDDVERLQRPRPTPREAPTQG 

I^CVSTRLRLAERRQQRLRBVQAKHKHLCEE 

LAETQGRLMLEPGRWLEQFEVDPELEPESAE 

YI^VALERATAAIJEQCVNLCKAHVMMVTCFD 

ISVAASAAIPGPQEVDV 


716 


2066 


A 


5529 


458 


790 


SPGYGE>IKFTVTS3CN1AVPLCEMNKIYSYYSD 
SSSSERTMDLVLEMCNTNSmWCGISGRQLG 
KLHPSSSLCLALTLLSSVQOLQSISGLRLTDTF 
LKRTYEYDDIAQVCV 


717 


2067 


A 


5531 


3 


460 


NSEDIXKYFNPESWQEDLDNMYLDTPRYRG 

RSYHDRKSKVDLDRLNDDAKRYSCTPRNYS 

VNIRBELKLANWFFPRCLLVQRCGGNCGCO 

TVNWRSCTCNSGKTVKKYHEVLQFEPGHIKR 

RGRAKTMALVDIQU3HHERCDCICSSRPPR 


718 


2068 


A 


3586 


311 


88 


AVLKNMAPMTALGLLDLHILNLILFLSAGEDF 

TSWSEIMMYILLVFLTLWLLIEMIYCYRKVS 

KAEEAAQENA 


719 


2069 


A 


5598 


1 


330 


KNCANEAWQKILDRVLSRYDVRLRPNFGSM 
LATNSTROLNEDELMAHOQEKDSSSESEDSC 
PPSPOCSFTEGFSFDLLNPDYVPKVDKWSRFL 
FPLAFOLFNIVAAERC 


720 


2070 


A 


5628 


798 


148 


IJPAQIPEAWLLLANVVVVLILVPLKDRLIDP " 

LLLRCKLLPSALQKMALGMFFGPTSVIVAGV 

LEMERLHYIHHNETVSQQIGEVLYNAAPLSIW 

WQIPQYLLIGISEIFASIPGLEFAYSEAPRSMQG 

AIMGDFFCLSGVGSLLGSSLVALLSLPGGWLH 

CPKIJFGNINNCRMDLYFFLLAGIQAVTALLF 

VWIAGRYERASQGPASHSRFSRDRG 


721 


2071 


A 


5632 


146 


536 


MSALIVRKLRSAm-TirsmJTVLGANVNAA 

KLHETALHHAAKVKNVDLIEMLIEFGGNIYA 

RDNRGKKPSDYTWSSSAPAKCFEYYEKTPLT 

LSQLCRV^^JRKATGVR<H.EK3AKLNIPPRLID 

YLSYN 


722 


2072 


A 


5638 


3 


3806 


CPSLDIRSEVAELRQLENCSWEGHLQILLMF 

TATGEDFROLSFPRLTQVTDYLLLFRVYGUES 

LRDLFPNLAVIRGTRLrLGYALVIFEMPHLRD 

VALPALGAVLRGAVRVEKNQELCHLSTIDW 

GLLQPAPGANHIVGNKLGEECADVCPGVLGA 

AGEPCAKTTFSGHroYRCWTSSHCQRVCPCP 

HGMACTARGECCHTECLOGCSQPEDPRACV 

ACRHLYFQOACX,WACPPGTYQYESWRCVTA 

ERCASLHSVPGRASTFGIHQGSCIAQCPSGFT 

RNSSSIFCHKCEGLC3«ECKVGTKTIDSIQAA 
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Amino add sequence (A=Alanine C=Cysteine, 
I>^spartic Add, EKjlntamic Acid, 
F=PhenylaIanine, G=Glycine, H=Histidine, 
I=lsoleucine, K=Lysine, D=Leuciiie, 
M^^lethionine, N^Asparagine, P=Pn>line, 
Q=Glutamine, R=Aiginine, S=^erine, 
T=Threonine, V=Valine, W=Tiyptophan, 
Y«>Tyrosinc, X=Unknowni ♦-Stop codon, 
/=^)0ssible nucleoticfe ddetion, V^ssible 
nucleotide insertion 














QDLVGCTHVEGSULNLRQOYNLEPQLQHSL 

GLVnETTTGFLKIKHSFALVSLGFFKNLKLIRGD 

AMVDGNYTLYVLDNQNLQQLGSWVAAGLTI 

PVGKIYFAFNPRLCLEHIVRLEEVTOTRGRQN 

KAEINPRTNGDRAACQTRTLRFVSNVTEADRI 

LLRWERYEPLEARDLLSFIVYYKESPFQNATE 

HVGPDACGTQSWNIXDVELPLSRTQEPGVTL 

ASLKPWTQYAVFVRArrLTTEEDSPHQGAQS- 

PIVYLRTLPAAPTVPQDVISTSNSSSHLLVRW 

KPPTQRNGNLTYYLVLWQRLAEDODLYLND 

YCMRGLRLPTSNNDPRFDGEDGDPEAEMESD 

CCPCQHPPPGQVLPPLEAQEASFQKKFENFLH 

NAmPISPWKVTSINKSPQRDSGRHRRAAGPL 

RLGGNSSDFEIQEDKVPRERAVLSGLRHFTEY 

RIDIHACNHAAHTVGCSAATFVFARTMPHRE 

ADGIPGKVAWEASSKNSVLLRWI^PDPNOL 

ILKYEIKYRRLGEEATVLCVSRLRYAKFGGV 

HLALLPPGNYSARVRATSLAGNGSWTDSVAF 

YILGPEEEDAGGLHVIXTATPVGLTLLIVLAA 

LGFFYGKKRNRTLYASVNPEYFSASDMYVPD 

EWEVPREQISnRELGQGSFGMVYEGLARGLE 

AOEESTPVAUCTVNELASreBClEFLKEASVM 

KAFKCHHWRLLGWSQGQPTLVIMELMTR 

GDLKSHLRSl-RPEAENNPGLPQPALGEMIQM 

AGEIADGMAYLAANKFVHRDLAARNCMVSQ 

DFTVKIGDFGMTRDVYETDYYRKGGKGLLP 

VRWMAPESLKDGmTHSDVWSFGWLWEIV 

TLAEQPYQGLSNEQVLKFVMDOGVLEELEGC 

PLQLQELMSRCWQPNPRLRPSFTHILDSIQEEL 

RPSFRLLSFYYSPECRGARGSLPTTDAEPDSSP 

TPRDCSPQNGOPGH 


723 


2073 


A 


5672 


1 


216 


LAWIDNILPEKEfaCETDKKRKRKKGAHEDCD 

EEPQFPPPSVKIPMESVQSDPQNGIHCIAEXR 

SSSWSYSL 


724 


2074 


A 


5704 


4235 


940 


ARGRRSRPVWAASWGGRGRPAARItRPRGLA 

ATMGFEIJ5RFDGDVDPDLKCALCHKVLEDP 

LTTPCGHVFCAGCVLPWWQEGSa>ARCaiGR 

LSAKELNHVLPLKRULKLDIKCAYATRGCGR 

WKLQQLPEHLERCDFAPARCRHAOCQQVLL 

RRDVEAHMRDACDARPVGRCQEGCGLPLIH 

GEQRAGGHCCARALRAHNGALQABLGALHK 

ALKKEALRAGKREKSLVAQLAAAQLELQMT 

ALRYQKKFTBYSARLDSLSRCVAAPPGGKGE 

ETKSL'ELVLHRDSGSLGFNnGGRPSYDNHDG 

SSSEOIFVSKIVDSOPAAKEGOLQIHDRIIEVN 

GRDLSRAraDQAVEAFKTAKEPIWQVLRRT 

PRTKMFTPPSESQLVDTGTQTDIIPEHIMALT 

KMSSPSPPVLDPYLLPEEHPSAHEYYDPNDYI 

GDIHQEMDREELELEEVDLYRMNSQDKLGLT 

VCYRTDDEDDIGIYISEIDPNSIAAKDGRIREG 

DRIIQINGIEVQNREEAVALLTSEENKNFSLLI 

ARAELQLDEGWMDDDRhnJFLDDLiaiDMLE 

EQHHQAMQFTASVLQQKKHDEDGGTTDTAT 

ILSNQHEKDSGVGRTDESTRNDBSSEQENNG 

DDATASSNPLAGQRKLTCSQDTLGSGDLPFS 

NESFISADCTDADYLGIPVDECERFRELLELK 

CQVKSATPYGLYYPSGPLDAGKSDPESVDKE 

LEIXNEELRSIELECLSIVRAHKMQQLKEQYR 

ESWMUlNSGFRNYNTSIDVRRHELSDnELPE 

KSDKDSSSAYNTGESCRSTPLTLEISPDNSLRR 
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Amino acid sequence (A<=Alanine OCysteine, 
IX^'Aspartic Acid, BKjlutamic Acid, 
F=4*enylalaninc, G=01ycine, H=Histidine, 
I=Isoleucine, K=Lysine, L=Leacine. 
N^Mettiicmine, N=A^aragine, P=Pn)line, 
Q=Glulaniine, R=Ai;ghiine, S=^erine, 
T^Threonine, V=Valine, W=Tiyptophan, 
Y=1>rTosine, X=Unknown, *=Stop codon, 
/=i)ossible nucleotide deletion, V=possible 
nucleotide insertion 














AAEOISCPSSEGAVGTTEAYGPASKNLLSrrc 

DPBVGTPTYSPaJCELDPNQPLESKERRASDG 

SRSPTPSQKLGSAYLPSYHHSPYKHAHIPAHA 

QHYQSYMQLIQQKSAVEYAQSQMSLVSMCK 

DI^SPTPSEPRhffiWKVKIRSDGTRYrrKRPVR 

DRLLRERALBaREERSGNOTDDDAVSEMKM 

GRYWSKEERKQHLVKAKEQRRRREFMMQSR 

LDCLKEQQAADDWCEMNILELSHKKMMKKR 

NKKIFDNWMTIQELLTHGTKSPDGTRVYNSF 

Lsvnv 


725 


2075 


A 


5707 


3 


1770 


QISTEVSEAPVANDKPKTLWKVQKKAADLP 

DRDTWKGRFDELMSCVGYAIGLGNVWRFPY 

LCGKNGGGAFIJPYFLTLIFAGVPLFIXECSLG 

QYTSIGGLGVWKLAPMFKGVGLAAAVLSFW 

LNIYYIYnSWATYYLYNSFTTTLPWKQCDNP 

WNTORCFSNYSNmnTNMTSAVVEFWERN 

MHQMTDGLDKPGQIRWPLAITLAIAWILVYF 

CIWKGVGWTGKWYFSATYPYIMLIILFFRGV 

TLPGAKEGILFYrrPNFRKLSDSEVWLDAATQ 

IFFSYGLGLGSLIALGSYNSFHNNVYRDSnVC 

\^XPlL>Vrfi OiXETAVJF VlTol VOriW/VTl V 1 IvKJ^I Al IV 

AASOPGLAFLAYPEAVTQLPISPLWAILFFSM 
UJ4LGIDSQFCTVEGFn'AL\T)EYPIUXRNfyR. 

ASGMSLLFLVFFECVSISWFYGVNRFYDNIQE 
MVGSRPCIWWKLCWSFFTPnVAGVFIFSAVQ 
MTPLTMGNYVFPKWGOGVGWLMAI <!«?MVT 
IPGYMAYMFLTLKGSLKQRIQVMVQPSEDIV 
RPENOPEQPQAGSSTSKEAYI 


726 


2076 


A 


5711 


156 


423 


PRRDPGRTPELRGSAPRKTGANMPVRRGHVA 

PQhrm/SrnRKFEGQNKKFIlANARVQNCAn 

YCNDGFCHvfrGFSRPDVMQKPCTCD 


727 


2077 


A 


5716 


3 


274 


HASEYFFKLCSFQVFLSFPLATIVIDVOLWIP 
LVKSPNVHYVYVLIXVLSGLLFYIPLIHFKIRL 
AWFEKMTCYLQLLFNICLPDVSEE 


728 


2078 


A 


5737 


1899 


649 


IQASRASPYPRVKVDFALSCHEDLLAPISEPIE 

WKYHSPEBEISLGPACWLWDFLRRSQQAGFL 

LPLSGGVDSAATACLIYSMCCQVCEAVRSON 

EEVLADVRTTVNQISYTPQDPRDLCGRILTTC 

YMASKNSSQETCTRARELAQQIGSHHISLNID 

PAVKAVMGIFSLVTGKSPLFAAHGGSSRENL 

ALQNVQARIRMVLAYLFAQLSLWSRGVHGG 

LLVLGSANVDESLLGYLTKYDCSSADIhJPIGG 

ISKTOLRAFVQFaQRFQLPALQSILLAPATAE 

LHPLADGQVSQTDEEDMGMTYAELSVYGKL 

RKVAKMGPYSMFCBCLLGMWRHICrPRQVAD 

KVKRFFSKYSMNRHKMTrLTPAYHAENYSPB 

DNRFDLRPFLYNTSWPWQFRCIENQVLQLER 

AEPQSLDGVD 


729 


2079 


A 


5741 


1 


5976 


pgcaarlsrarapgpgaagagrkrladpgpp 

pasrrlrapgsrprlapctkraaqpaharma 

praaggAplsaraaaaspppfqtpprcpvpll 

lllllgaaragal^iqrrfpsptptnnfaldg 

aagtvylaavnrlyqlsganlsleaeaavg 

pvrosplchapqlpqascehprrltdnynkil 

qldpgqglvwcgsiyqgfcqlrrrgnisav 

avrfppaappaefvtvfpsmlnvaanhpnas 

tvglvlppaaoaggsrllvgatytgygssff 

prnrsledhrfehttpeiairsldtrodlaklpt 
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Amino acid sequence (A=Alanine OCysteine^ 
I>=Aspartic Acid, EF<jlutamic Add, 
F=PlienyIalaninB, G=Glycine, H=Histidine, 
I=Isolcucine, K=i,y5ine, Lr4xucine, 
K^Medilmine, I^Asparggine, P=Pn)line!, 
Q=Ghitamine, R=Aixinine, S=Serinei 
T=Threoninei V=VaIine, W=Tr)rpitoiian, 
Y=Tyrosine, X=Unknown, *=StDp codon, 
^ix>ssible nucleotide deletion, V^ssiblc 
nucleotide insertion 












• 


fdlnpsddnilkikqgakeqhklgfvsaflhp 

sdpffgaqsyaylalnsearagdkesqarsl 

lariclphgaggdakkltesyiqlglqcagg 

agrgdlysrlvsvfparerlfavferpqgspa 

araapaalcafrfadvraakaartacrvep 

apdwavldswqgtgpacerianiqlqpeq 

ux:gaahlqhplsilqplkatpvfrapgltsv 

avasvnnytavflgtvngrllkinlnesmq 

wsrrwtvaygepvhhvmqfdpadsgyly 

lmtshqmarvkvaacnvhstcodcvgaad 

aycgwcaletrctlqqdcmssqqhfwtsa 

segpsrcpamtvlpseidvrqeypgmilqisgs 

lpslsgmemacdygnnirtvarvpgpafghq 

laycnllprdqfppfppnqdhvtvh^vrvn 

grnivkanftiydcsrtaqvyphractsclsa 

qwpcfwcsqqhscvsnqsrceaspnptspqd 

cprtllsplapvptxjgsqnilvplantaffqg 

aalecsfgleeifeavwvneswrcdqwlh 

ttrksqvfplslqixgrparfldspepmtvm 

vyncamgspdcsqclgredlghlcmwsdgc 

rlrgplqpmagtcpapeiraieplsgpldggt 

llmgrnlgrrlsdvahgvwigovaceplp 

drytvseeivcvtopapgplsgwtvnaske 

gksrdrfsyvlplvhsleptmgrkaggtrm 

hgndlhvgselqvlvndtdpcrelmrtdtsi 

acmpegalpapvpvcvrferrgcvhgnltf 

wymqnpvitaisprrspvsggrtitvagerfh 

mvqnvsmavhhigreptlckvlnstlrrcpsp 

galsnasapvdffingrayadevavaeelld 

peeaqrgsrfrldylpnpqfstakrekwikh 

hpgepltlvmvstkgagkeqdslglqshey 

rvkigqvscdiqivsdriihcsvneslgaavgq 

LPmQVGNFNQTlATLQLGGSETAnVSrVlCSV 

IXIXSWAU'VFCTKSRRAERYWQKTLLQME 

EMESQIREEIRKGFAELQTDMTDLTKELNRSQ 

OIPFLEYKHFVTRllWKCSSLYEERYVLPSQT 

LNSQGSSQAQETHPLLGEWKIPESCRPNMEE 

GISLFSSLLDNKHFUVFVHALEQQKDFAVRD 

RCSLASLLTIALHCaaJBYYTSIMKELLVDLID 

ASAAKNPKIAdLRRlESWEKMLTNWMSICM 

YSCLRETVGEPFFLLLCAIKQQINKGSIDArrO 

KARYTU^EEWLLRENIEAKPKNLNVSFQGCG 

KlDSl^VRAMDTim.TQVKEKILEAFCKNVPY 

SQWPRAEDVDLEWFASSTQSYILRDLDDTSV 

VHXJMIKLNTLAHYKIPEGASLAMSLIDKKD 

NTLGRVKDIX)TEKYPmVLPTDELAEPKKSH 

RQSHRKKVLPEIYLTRLLSTKOTLQKFLDDLF 

KAILSIREDKPPLAVKYFFDFLEEQAEKRGISD 

PDTLHIWKTNSLPLRFWVNILKNPQFVFDIDK 

TDHIDACLSVIAQAFIDACSISDLQLGKDSPTN 

KLLYAKEIPEYRKIVQRYYKQIQDMTPLSBQE 

MNAHLAffiSRKYQNEFNTNVAMAEIYKYAK 

K.iKl'VillwAALoANrlAKRIQLQHKjrEQVVAL 

MEDNIYECYSEA 


730 


2080 


A 


3744 


3 


292 


QPSPLFHSHLEfLQLLRTAQLPEQVSWPWOO 
VANGKGNQKNMGSPQPSLLAFERNLELQIMG 
LGYSLLMGKLRPRVAKDTLRVHRDSTPSPLT 
LKD 


731 


2081 


A 


5747 


1 


382 


FLKCMRKAFRSSJaXQVGYTPDGKDDYRWC 
FRVDBVNWTTWNTNVGIINEDPGNCEOVKRT 
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seq- 
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NO: of 
peptide 
seq- 
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hod 
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nucleotitte 
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ng to first 

amino acid 
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Predicted end 
nucleotide 
location 
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to last amino 
add residue 
of peptide 
sequence 


Amino acid sequence (A^^Alanine 0=Cyst6ine, 
D=ASDaftic Acid- &=Glii1aniic AcicL 
IM'henylalanine, G=GlyciDe, H=Histidine, 
Msoleucine, K=Lysine, LNLeucine, 
M^^tethionine, N=Asparagine, P=Proline, 
Q=Glutainine, R^Arginine, S=^erine, 
T=Threonine, V=Valine, W=Tiyptophan, 
Y=TyTosine, X-Unknown, ♦=Sl«^ codon, 
/=70ssible nucleotide deletion, \=4ossibIe 
nucleotide insertion 














LSFSLRSSRVSGRHWKNFALVPLLREASARD 

RQSAQPEEVYLRQFSGSLKPEDAEVFKSPAAS 

GEK 


732 


20S2 


A 


5753 


198 


3 


AQAESSTVASPEATAGPLCTRIPNVPPPTPrRP 
PGKLQAQLPCPSPVRFTSARIPPASRPQTKS 


733 


2083 


A 


5754 


2 




AAGPPGLEAEGRAPESAGPGPGGDAAETPa 

PPAHSGTLMMAFRDVTVQIANQNISVSSSTAL 

SVAN(XGAQTVQAPAEPAAGKAEQGETSGR 

EAPEAPAVGREDASAEDSCAEAGASQAADG 

ATAPKTEEEEEEEETAEVGRQAEAEAGDLEQ 

LNRTSTSIKSAKSGSEASASASKDALQAMILS 

LPRYHCENPASCKSPTLSTDTLRKRLYRIGLN 

LFNINPDKGIQFLISRGFIPDTPIGVAHFLLQRK 

GLSRQMIGEFLGNSKKQFNRDVLDCWDEM 

DFSSMELDEALRKFQAfflRVQGEAQKVERUE 

AFSQRYCMCNPEVVQQFHNPDTIFILAFAIILL 

NTONIYSPNIKPDRKKIMLEDFIRNLRGVDDG 

ADIPRELWGIYERIQQKELKSNEDHVryVTK 

VEKSIVGNIKTVLSVPHRRLVCCSKLFEVTDV 

NKLQKQAAHQREVFLFNDLLVILKLCPKKKS 

OS 1 1 1 r CKi VOlJjUMQrQIJ'liNKYYSHGITLV 

TPLSGSEKKQVLHFCALGSDEMQKFVEDLKE 

S1AEVTELEQIRIEWELEKQQG7XTLSFKPCGA 

QGDPQSKQGSPTAKREAALRERPAESTVEVSI 

QQTPPLPPPPPTPPGTLVQCQQIVKVIVLX>KPC 
LARMEPLLSQALSCYTSSSSDSCGS1PLGGPG 
SPVKVTHQPPLPPPPPPYNHPHQFCPPGSLLH 
GHRYSSGSRSLV 


734 


2084 


A 


5788 


8 


362 


SSVMGDLVGQGLEEQIVARDENSWLIDGGTP 
IDDVMRVLDIDEFPQSGNYETIGGFMMFMLR 
KIPKRTDSVKFAOYKFEWDIDNTOIDQLLVT 
RIDSKATALSPKLPDAKDKEESVA 


735 


2085 


A 


5827 


1 


1257 


MVFSAVLTAFHTGTSNTIl-VV YENTYMNITL 
PPPFQHPDLSPLLRYSFETMAPTGLSSLTVNST 
AVPTTPAAIXSI^PLQITLSAIMIFILFVSFLG 
NLVVOJvIVYQKAAMRSAINlLLASLAFADM 

t T AVT 'MKifPPAT VTIT TTPWIPrtVTTPnJVQ AXjfl? 

i.(i.tA vi^iNivijrr/u^ V iiLii iJEvwiroAj*riJr\.voAJnr 
FWIJVIEGVAn-UISIDRFLirVQRQDKLNPYR 
AKVUAVSWATSFCVAFPLAVGNPDLQIPSRA 
POCVFGYTTNPGYOAYVILrSLISFFTPFr VILY 

SFMOILNTXRHNALRIHSYPEGICLSQASKLGL 

MOIXJRPFQMSroMOFKTRAFrnLIIJFAVnVC 

WAPFTTYSLVATFSKHFYYQHNFFEISTWLL 

WiryiJKSAIJ4PlJrYWRIKKinroACXDMNdP 

KSFKFIJPQLPGHTKRRIRPSAVYVCCHHRTVV 


736 


2086 


A 


5870 


3 


268 


FTRSDELARHYRTHTGEKRFSCPLCPKQPSRS 
DHLTKHARRHPTYHPDMIEYRGRRRTPRIDPP 
LTSEVESSASGSGPGPAPSFTTCL 


737 


2087 


A 


5871 


2 


521 


LTWPQLFLETLPELLHMSRPAEDGPSPGALVR 

RSSSLGYISKAPEYFLLKSRSDLMFEKQSERH 

GLARRLTTARRPPASSEQAQQELFNELKPAV 

DGANFIVNHMRDQNNYNEEKDSWNRVART 

VDRLCIJVVTPVMVVGTAWIFLQGVYNQPPP 

QPFPGDPYSYNVQDKRFI 


738 


2088 


A 


5881 


1 


1160 


LVVTAITAILAFPNEyTRMSTSELISELFNDCG" 
LLDSSKLCDYENRFNTSKGGELPDRPAGVGV 
YSAMWQI^TULKrvniFTFGMKIPSGLFIPS 
MAVGAIAGRLLOVGMEQLAYYHQEWTVFNS 
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NO: of 
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seq- 
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SEQ ID 
Naof 
pqjtide 
seq- 
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Met 
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locatioD 

correspondj 
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Amino add sequence (A=Alanine C=CystBine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=4%enylalanine, G=Glycine, H=Histidine, 
I=]soleucine, K=Lysine, LF=Leucine, 
M^ethionise, N^Asparagine, P=Prolinc, 
Q=Glutamine, R=Argiiiine. S=Serine, 
T=Threonine, V=Valine, W=Tiyptophan, 
Y=Tyrosine, X=Unknown, •=Stop codon, 
/^possible nucleotide deletion, V=possible 
nucleotide insertion 














WCSQGADCrrPGLYAMVGAAACXGGVTOMT 

VSLWIMFELTGGLEYIVPLMAAAMTSKWVA 

DALGREGIYDAHIRLNGYPFLEAKEEFAHKTL 

AMDVMKPRKNDPLLTVLTQDSNlTVEDVETn 

SETTYSGFPVWSRESQRLVGFVLRRDLnSIE 

NARKKQDOVVSTSnYPTEHSPPLPPYTPPTLK 

LRNILDLSPFTVTDLTPMEIVVDIFRKLGLRQC 

LVTHNGIOIXjimCKDVLKHIAQMANQDPDSI 

LFN 


739 


2089 


A 


5892 


2 


916 


TLQLAASVPFFAISLISWWLPES/UIWLIINGKP 

DQALQELRKVARINGHKEAKNLTIEVLMSSV 

KEEVASAKEPRSVLDLFCVPVLRWRSCAMLV 

VNFSLUSYYGLVFDLQSLGRDIFLLQAUFGA 

VDFLGRATTALLLSFLGRRTIQAGSQAMAGL 

AILANMLVPQDLQTLRWFAVLGKGCFGISL 

TCLTIYKAELFPTPVRMTADGILHTVGRLGA 

MMOPLILMSRQALPLLPPLLYGVISIASSLWL 

FFLPETQGLFLPDTnQDLESQKSTAAQGNRQE 

AFTVESTSLLETVALHGAL 


740 


2090 


A 


5900 


2 


426 


RPDCTLGIGFHFSVDGVHFLTQREVQNLWKE 

NLIIU3TAKKHGYEVVDTFT1TMGRYKEFLQG 

KCGCHFHEVVKSKLSKEYNFIKMKRSKNHIM 

GRYFSNQSKLQQOTVTNFRSPYUVROPINQV 

CSBILLSRMCANKRTM 


741 


2091 


A 


5910 


3 


412 


R1V1PESTLLUCENGYILEAPLPTKQEEDDHDV 
VSYEKDMCIKCFHFSSVKSKILRLIEIEKRER 
QBELKEKIREERRNKLAAEMGEDGEKEFQEE 
EEEKEEEEEEEEPLPEIFIPSTPSPILCGFYSEPO 
KFWV 


742 


2092 


A 


5936 


1 


482 


MGCRLLCCVATCLLQAGPLDTAVSQTPKYLV 

TQMGNDKSDCCEQNLGHDTMYWYKQDSKK 

FLKIMFSYNNKELIINETVPNRFSPKSPDKAHL 

NLHINSLELGDSAVYFCASSQDTALQSHC3PV 

HKPPGSARKLQGSVCTCTQGSSLHSLMASDO 

VPVC 


743 


2093 


A 


5938 


1 


1566 


MNSFFGTPAASWCLLESDVSSAPDKEAGRER 

RALSVQQRGGPAWSGSLEWSRQSAGDRRRL 

GLSRQTAKSSWSRSRDRTCCCRRAWWILVPA 

ADRARRERFIMNHCWDTNSSENWHPIWNVN 

DTFOfflLYSDINITYVNYYLHQPQVAAIFnSYF 

LIFiaX>iMGNTWCFrVMW«HMHTVTNI^ 

LNLAISDLLVGlFCMPrnXDNIIAGWPFGNTM 

CKISGLVQGISVAASVFTLVAIAVDRFQCWY 

PFKPKLTIKTAFVnMIIWVLArnMSPSAVMLH 

VQEEKYYRVRLNSQNKTSPVYWCREDWPNQ 

EMRKIYTrVLFANIYLAPLSLIVIMYGRIOISLF 

RAAVPHTGRKNQEQWHWSRKKQKIIKMLU 

VALLFILSWLPLWTLMMLSDYADLSPNELQn 

NIYIYPFAHWLAFGNSSVNPnYGFFNENFRRG 

FQEAFQLQLCQKRAKPMEAYALKAKSHVLIN 

TSNQLVQESTTQNPHOETLLYRKSAEKPQQE 

LVMEELKETTNSSEI 


744 


2094 


A 


5966 


149 


327 


SHVCVSHYAGSSGCPAGAGAGAVALGISAVA 
LYDYQGGRLGVARGAWYMEAPDIRQGDM 


745 


2095 


A 


5970 


413 


856 


OAPHTDWAWAPTPMSGLGSORORQGTLASS 
PLSLPLLLAOVTOILATELFDQMARPAACMV 
CGALMWmULVGLGFPFIMEALSHFLYVPFL 
GVCVCGAIYTGLFLPETKGKTFQEISKELHRL 
NFPRRAQGPTWRSLEVIQSim 
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seq- 
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SEQ ID 
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pqjtide 
seq- 
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b^inning 
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residue of 

peptide 

sequence 


Predicted end 
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to last amino 
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of peptide 
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Amino add sequence (A=Alanine C=CystBine, 
I>=Aspaitic Acid, E=GlulaimcAci<i, 
F=4%enylalanine, G-Olycine, HHHistidine, 
I-lsoleudnc, K=Lysine, LrLeucine, 
M=Metfaiaiiine, I<^Aq)aragtne, INProline, 
Q=Glutainiae, R=Argiiiine, S=Seriiie, 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosine, X^Unknown, •=Stop codon, 
^^ossible nucleotide deletion, V^ossible 
nucleotide insertion 


746 


2096 


A 


5971 


3 


1343 


AQTARIUIGLELDTEGHRLFV/VFSGCrVYLPLS 

RCARHGACQRSCLASQDPYCGWHSSRGCVDl 

RGSGGTDVDQAGNQESMEHGDCQDGATGSQ 

SGPGDSAYGVRRDLPPASASRSVPIPLLLASV 

AAAFALQASVSQLLVSCACRRAHRRRGKDIE 

TPOLPRPLaLRSLARLHOGGPEPPPPSKDGDA 

VQTPQLYTTFLPPPEGVPPPELACXJTTESTPE 

LPVKHLRAAGDPWEWNQNRNNAKEGPGRSR 

GGHAAGGPAPRVLVRPPPPGCPGQAVEVTTL 

EELLRYLHGPQPPRKGAEPPAPLTSRALPPEP 

APALLGGPSPRPHECASPLRLDVPPEGRCASA 

PARPALSAPAPRLGVGGGRRLPFSGHRAPPAL 

LTRVPSGGPSRYSGGPGKHLLYLGRPEGYRG 

RALKRVDVEKPQLSLKPPLVGPSSRQAVPNG 

GRFNF 


747 


2097 


A 


5998 


2 


754 


DHASLPCSWNHRFDVETRHVFIGDHSGQVn 

IJKlJEQENCrLVTTFRGHTGOVTALCWDPVQ 

RVLFSGSSDHSVIMWDIGGRKGTAIELQGHN 

DRVQALSYAQHTRQUSCGGDGGIWWNMD 

VERQETPEWLDSDSCQKCDQPFFWNFKQMW 

DSKKIGLRQHHCRKCGKAVCGKCSSKRSSIPL 

MGFEFEVRVCDSCHEAITOEERAPTATFHDSK 

HNTVHVHFDATROWLLTSOroKVIKLVVDNfr 

PWS 


748 


2098 


A 


6001 


2 


747 


AMVFGGWPYVPQYRDIRRTQNADGFSTYV 

CLVLLVANILRILFWFGRRFESPLLWQSAIMIL 

TMLLMLKLCTEVRVANELNARRRSFTAADS 

KDEEVKVAPRRSFLDFDPHHFWQWSSFSDYV 

QCVLAFTOVAOYITYLSIDSALFVETLGFLAV 

LTEAMLGVPQLYRNHRHQSTEGMSKMVLM 

WTSGDAFKTAYFLLKQAPLQFSVCGLI,QVLV 

DLAILGQAYAFARHPQKPAPHAVHPTGTKAL 


749 


2099 


A 


6002 


2 


447 


GEPDRSELVRMHILEETTAEPSLQATQMKLK 

RARLADDLNHOAQRPGPMELVEKNILPVDSS 

VKEAnGVGKEDYPHTQGDFSFDEDSSDALSP 

DQPASQESQGSAASPSEPKVSESPSPVTTNTP 

AQFASVSPTVPEFLKTPPTAD 


750 


2100 


A 


6004 


2 


427 


LLTQAMLVLPHRPQWFTPGPRLQAQGPCXJEG 

WRWELRLRNYVPEDEDLNKRRVPQAKPDAV 

QEKVKEQLEAAKPEPVIEEVDLAKLAPRKPD 

WDLKRDVAKKLEKLLKRTQRAIAEURERLK 

GQEDSU3SAVDAATEHKTC 


751 


2101 


A 


6007 


33 


1280 


TJXJAKVDNQPEKLVRSAEDVSTVPTQPDNPF 

SHPDKLKRMSKSVPAFLQDESDDRETDTASE 

SSYQLSRHKKSPSSLTNLSSSSGMTSLSSVSGS 

VMSVYSGDFGNLEVKGNIQFAIEYVESLKEL 

HVFVAQCKDLAAADVKKQRSDPYVKAYLLP 

DKGKMGKKKTLWKKTLNPVYNEILRYKIEK 

QIUCrQKLNLSlWHRDTFKRNSFLGEVELDLE 

TWDWDNKQNKQLRWYPLKRKTAPVALEAE 

NRGEKOCLALQYVPEPVPGKKLPTTGEVHIWV 

KECXDUTXRGSHLNSFVKCriLPDTSRKSRQ 

KTRAVGKTTNPIFNHTMVYDGFRPEDLMEAC 

VELTVWDHYKLTNQFLGGLRIGFGTGKSYGT 

EVDWMDSTSEEVALWEKMVNSPNTWIEATL 

PLRMLLIAKISK 


752 


2102 


A 


6028 


108 


1283 


KEIFSPFEUSVKPUXLLGVTCSQSMAFEEU. 

SQVGGLGRFQMLHLVFIU»SLMLUPHILLENF 

AAAIPGHRCWVHMLDNNroSGNETOILSEDA 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

begjnning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

pqitide 

sequence 


Predicted end 
nucleotide 
location 
corre^nding 
to last amino 
add residue 
of peptide 
sequence 


Amino acid sequence (A^Alanine O^Cysteine, 
D=Aspartic Acid, B=01utamic Acid, 
F'=Phenylalaiiine, G=GIycine, KNHistidine, 
I=lsoleucine, K=Lysine, Lr=Leucine, 
M=Methicaiine, N^^Asparagine, P=Proline, 
Q=Glutamtne, R=Aiginine, S=^erine, 
T=Threonine, V=Valine, W=Ttyptophan, 
Y-Tyrosine, X-Unknown, *=Stqp codon, 
/=possible nucleotide deletion, V=possible 
nucleotide insertion 














LLRISIPLDSNLRPEKCRRFVHPQWQLLHLNG 

TfflSTSEADTEPCVDGWVYDQSYFPSTIVTKW 

DLVCDYQSLKSWQFLLLTGMLVGOnGGHV 

SDRFORRFILRWGLLQLAITDTCAAFAPTFPV 

YCVLRFLAGFSSNIfflSNNSLPITEWlRPNSKAL 

VmSSGAU4IGQIILGGLAYVnU)WQTLHVV 

ASVPFFVFFLLSRWLVESARWLDTNKLDEtaL 

KALRKVARTNGKNAEETLNIEWRSTMQEE 

UDAAQTXTTVWDLFRNPSMRKRICILVFLRK 

KNLKEKA 


753 


21Q3 


A 


6043 


1 


1470 


DSFESELRLIFEIHHSGEKGDIWFLACEQDIEK 

VCETVYQGSNLNPDLGELVWPLYPKEKCSL 

FKPLDETEKRCXJVYQRRWLTTSSGEFUWSN 

SVRFVIDVGVERRKVYNraiRANSLVMQPISQ 

SQAEIRKQILGSSSSGKFFCLYTEEFASKDMTP 

LKPAEMQEANLTSMVLFMKRIDIAGLGHCDF 

MNRPAPESLMQALEDUDYLAALDNDGNLSE 

FGnMSEFPLDPOLSKSn ASCFFDCVTiPVI TTA 

AMVTAPNCFSHVPHGAEEAALTCWKTTLHPE 

GDHFn,ISrYKAYQDTTLNSSSEYCVEKWCRD 

YFLNCSALRMADVIRAaLEIIKRIELPYAEPA 

FGSKENTLNIKKALLSGYFMQIARDVDGSGN 

YLMLTHKOVAOLHPLSGYSrrKKMPFWVLF 

HKFSISENNYnUTSEISPELFMQLVPQYYFSNL 

PPSESKDILQQWDHLSPVSTMNKEQQMCET 

CPETEQRCTLQ 


754 


2104 


A 


6035 


2 


394 


YYALHHWPFPDLLCQTTOAIFQMNMYOSCIF 

LMLINVDRYAAIVHPLRLRJILRRPRVARLLC 

LGVWALILVFAVPAARVHRPSRCRYRDLEVR 

LCFESFSDELWKGRLLPLVLLAEALGFLLPLA 

AWYSS 


755 


2105 . 


A 


6059 


3 


1795 


LGLGSGTLLSVSEYKKKYREHVLQLHARVKE 

RNARSVKITKRFTKLLIAPESAAPEEALGPAEE 

PEPGRAIUIS1>IHTFNRLFRRDEEGRRPLTVVL 

QGPAGIGKTMAAKKILYDWAACaCLYQGQVD 

FAFFMPCGELLERPGTRSLADLEiDQCPDRGA 

PVPQMLAQPQMiFIIX)G/U)ELPALGGPEAAP 

CnDPFEAASGARVLGGLLSKALLPTALLLVTT 

RAAAPGRLQGRLCSPQCAEVRGFSDKDKKK 

YFYKFERDERRAERAYRFVKENEXLFALCFV 

PFVCWWCrVLRQQLELGRDLSRTSKTTTSVY 

UJTTSVLSSAPVADGPRLQGDLRNLCRLARE 

GVLGRRAQFAEKELEQLELRGSKVQTLFLSK 

KEIJPOVLETEVTYQFIDQSFQHTJVALSYLLE 

DGGVPRTAAGGVGTLLRGDAQPHSHLVLTT 

RFLFGLLSAERMRDIERHFGCMVSERVKQEA 

LRWVQGQGQGCPGVAPEVTEGAKGLEDTEE 

PffiBEEGEEPNYPLELLYCLYETQEDAFVRQA 

LCRFPELALQRVRFCRMDVAVLSYCVRCCPA 

QQALRLISCRLVAAQEKKKKSLGKRLQASLO 

GO 


756 


2106 


A 


6060 


12 


436 


SGRPTRPAKPTGQGMGRFMLTLVCQGSIMMS 
ARDLIMNNLTELQPGLFHHLRFLEELRLSGNH 
LSHIPGQAFSGLYSLKILMLHNNQLGGIPAQA 
LWELPSLQSLRUJANLISLVPERSFEGLSSLRH 
LWLDDNALTEIPS 


757 


2107 


A 


6063 


54 


419 


rrPLGLGAADMCAFPWT .1 .1 .1.1.1 .1 .QEGSQRRL 
WRWCGSEEWAVLQESISLPLEIPPDBBVENn 
WSSHKSIATVVPGKEGHPATTMVTNPHYQG 
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SEQID 
NO: of 
nucl- 
eotide 
sat- 
uence 


SEQID 
NO: of 
pqjtide 
seq- 
uence 


Met 
bod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

begimiing 

oudeota'de 

location 

coirespondi 

sg to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
add residue 
of peptide 
sequence 


Amino and sequence (A=AIanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine, H=Histidine, 
Msoleudne, K=Lysine, LHLcucinc, 
N^Metfaionine, N=Aspaiagine, P=Pn)linei 
Q=Ghjtamine, R=Aiginine, S=Scrine^ 
T=Threonine, V=Valine, W=TiyptoiAan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
A^iossible nucleotide deletion, V=possible 
nucleotide insertion 














QILTMLLRSLQQPSASWPRDCSSSCSW 


758 


2108 


A 


6066 


123 


438 


IGISCPAriFVPMFSHSLIGIGEEYQLPYYNMV 
PSDPSYEDMREWCVKRLRPIVSNKWNSDEC 
LRAVLKLMSECWAHNPASRLTAUOKKTLAK 
MVESQDVKI 


759 


2109 


A 


6072 


3 


650 


PGRRITU»AAIJEERAMEKLREKVFFQNRGKGT 

LSSnPNNSDTRKATETTSLSSKPEYVNPDFRW 

SKDPSSKSGNLLETSEVGWTSNPEELDPIRLA 

LLGKSGLSCQVGSATSHPVSCQH>IDEDQRISP 

KDKSTAGREFSGQVSHQTTSENQCTPIPSSTV 

HSSVADMQNMPAAVHALLTQPSLSAAPFAQ 

RYLGTLPSTGSTTLPQCHAGNATVW 


760 


2110 


A 


6077 


3 


730 


PLRLTLMEEVUXGLKDREGYTSFWNDCISSG 

LRGCMLIELPLRGRLQLEACGMRRKSLLTRK 

VICKSDAPTGDVLJUDEALKHVKETQPreTVQ 

NWIELLSGETWNHjCLHYQLRNVRERLAKNL 

VEKGVLTTEKONFLLFDMTTHPLTNNNIK'nR 

LIKKVQEAVLDKWVNDPHRMDRRLLALIYL 

AHASDVLH>JAFAPLLDEQYDLATKRVRQLLD 

LDPEVECLKANTNEVLWAWAAFTK 


761 


2111 


A 


6078 


833 


390 


rVSFHLSOFKKFVRPFSFLSVHGUJVDEYHSV 
HQKLSADMADHSNLIRSLLVGAEDARLMRD 
NOCTMKSRYMELYDLNRDLLNGYKIRWNNH 
TELLGNLKAVNQAIQRAGRLRVGKPKNQVrr 
ACRDAIRSNNINTLFKIMRVGTASS 


762 


2112 


A 


6079 


2 


2686 


KKArrCOEKEKQDUKSLAMLKDGFRTDRGS 

HSDLWSSSSSLESSSFPLPKQYLDVSSQTDISG 

SFGINSNNQLAEKVRLRLRYEBAKRRIANLKI 

QLAKLDSEAWPGVLDSERDRLIUNEKEELLK 

EMRFISPRKWTQGEVEQLEMARKRLEKDLQ 

AARDTQSKALTERLKLNSKRNQLVRELEEAT 

RQVATLHSQLKSLSSSMQSLSSGSSPGSLTSSR 

OSLVASSLDSSTSASFTDLYYDPFEQLDSELQ 

SKVEFLLLEGATGFRPSGCrmHEDEVAKTQ 

K/VEGGGRLQALRSLSGIPKSMTSLSPRSSLSS 

PSPPCSPLMADPLLAGDAFLNSLEFEDPELSA 

TLCELSLGNSAQERYRLEEPGTEGKQLGQAV 

NTAQGCGLKVACVS/\AVSDESVAGDSOVYE 

ASVQRLGASEAAAFDSDESBAVGATRIQIALK 

YDEKNKQFAHJIQLSNLSALLQQQDQKVNIR 

VAVLPCSESTTCLFRTRPLDASiyn.v™EVFW 

VSMSYPALHQKTUIVDVCTIDRSHLBECLGG 

AQISLAEVCRSGERSTRWYNLLSYKYLKKQS 

RELKPVGVMAPAJSGPASTDAVSALLEQTAVE 

LEKRQEGRSSTQTLEDSWRYEETSENEAVAE 

EEEEEVEEEEGEEDVFTEKASPDMDGYPALK 

VDKETNTETPAPSPTVVRPKDRRVGTPSQGPF 

LRGSnKSKTFSPGPQSQYVCRLNRSDSDSST 

LSKKPPFVRNSLHIRSVRMKRPSPPPQPSSVK 

SLRSERLIRTSLDLELDLQATRTWHSQLTQEIS 

VLKBLKBQLEQAKSHGEKELPQWLREDERER 

LLLRMLEKRMDRAEHMGELQTDKMMRAAA 

KDVHRLRGQSCKEPPEVQSFREKMAFFTHPR 

MNIPALSADDV 


763 


2113 


A 


6082 


3 


1SS8 


PHPIRFSia.CVSFNNQEYNQFCVIBEASKANE 

VLENLTQOKMCLVPGKTRKLLFKFVAKTED 

VGlCKffirrSVDIALGNETGRCVVLNWQGGGG 

DAASSQEALQAARSFKRRHKUDNEVHWGSn 

IQASTMDSRVPNISVHLLHEPPALTNHVfYCLV 
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SEQO) 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ngtoiiist 

amino acid 

.residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A-=Alanine OCysteine, 
^^Aspartic Acid, B'Kjiutanuc Acid, 
F=PhenyIalanine, G=Olycine, H=Histidine, 
I=Isoleucine, K=LysiDe, L=Leucine, 
M=Melliianine, T^Aspaiagine, P=Proline, 
Q=Glutamine, R=Aisinin^ S=Serinei 
T=nueonine, V=Valinei W=Tiyptophan, 
Y=Tyrosine, X-^Inknown, *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 














VTVQSHEKTQIRDVKLTAGLKPGQDANLTQK 
THVTLHGTELCDESYPALLTDIPVGDLHPGEQ 
LEKMLYVRCGTVGSRMFLVYVSYLINTTVEE 

EHLERVYADIPFLLMTDLLSASPWALTIVSSE 

LHLAPSMTTVDQLESQVDNVILQTGESASECa' 

CLQCPSLGNIEGGVATGHYnSWKRTSAMENI 

PXnrvrrLPHVIVENIPLHVNADLPSFGRVRES 

LPVKVHLQNKTDLVQDVEISVEPSDAFMFSG 

LKQIRLRILPGTCQHklLYNFYPLMAGYQQLPS 

LNINLLlUTNFTNQLUiiUTPTSIFVKPQGRLM 

DDTSIAAA 


764 


2114 


A 


6093 


1 


1422 


AAADLANSNAGAAVGRKAGPRSPPSAPAPAP 

PPPAPAPPTLGNNHQESPGWRCCRPTLRERN 

ALNIFNNEIMADVHFVVGPPGATRTVPAHKY 

VLAVGSSVFYAMFYGDLAEVKSEIHIPDVEPA 

AFLOXKYMYSDEIDLEADTVLATLYAAKKYI 

VPALAKACVTvJFLETSLEAKNACVLLSQSRLF 

EEPELTQRCWEVIDAQAEMAUISEGFCEIDR. • 

VJlJLJMlV IKJlALNl KJlAVVrfiAVLNWAEAEC 

KRQGLPITPRNKRHVLGRALYLVRIPTMTLEE 

FANGAAQSDILTLEETHSIFLWYTATNKPRLD 

FPLTKRKGLAPQRCHRFQSSAYRSNQWRYRG 

RCDSIQFAVDRRVFIAGLGLYGSSSGKAEYSV 

KmMU/rVVI^QNLTKFMSDGSSNTFPVWF 

EHPVQVEQDTFYTASAVLDGSELSYFGQEGM 

TEVQCXJKVAFQFQCSSDSnTNGTGVQOGQIPE 

LIFYA 


765 


21 IS 


A 


6099 


1 


1130 


SGFTHYAIYDnVKGSCFCNVHADQCIPVHGF 

RPVKAPGTFHMVHGKCMCKHNTAGSHCQH 

CAPLYNDRPWEAADGKTGAPNECRTCKCNG 

HADTCHFDVNVWEASGNRSGGVCDDCQHN 

1 ILU\^ I cyJvUKrur iJKlJljKJU^rSArDACKPCS 

CHPVGSAVLPANSVTFCDPSNGDCPCXPGVA 

GRRCDRCMVGYWGFGDYGCRPCDCAGSCD 

PITGDCISSHTDIDWYHEVPDFRPVHNKSEPP 

WEWEDAQGFSALLHSGKCECKEQTLGNAKA 

FCGMKYSYVI IfUfn «!AHr>IfOTHVFV>JVK'IR' 

KVLKSTKLKIFRGKRTLYPESWTDRGCTCPIL 

NPGLEYLVAGHEDERTGKUVNMKSFVQHWK 

PSU3RICVMDILKRECK. 


766 


2116 


A 


6103 


2 


384 


MTAAATATVLKEGVLEKRSGGLLQLWKRKR 

CVLTERGLQLFEAKGTGGRPKELSFARKAYE 

CVESTGRHlYFTLVTEQGGEIDFRCPLFDPnW 

NAQITLGLVKFKNQQAIQTVRARQSLGTGTL 

VS 


767 


2117 


A 


6106 


1 


542 


SGSSHASDGSGFQELRICSHDQTPUAGMCSLP 

MARYYIKYADQKALYTRDGQLLVGDPVAD 

NCCAEKICTLPNRGLDRTKVPIFLGIQGGSRC 

LACVETEEGPSLQLEDVNIEELYKQGEEATRF 

TFFQSSSGSAFRLEAAAWPGWFLCGPAEPQQ 

PVQLTKESEPSARTKFYFEQSW 


768 


2118 


A 


6109 


3 


292 


FILQAVLQLSSQEARYKAFGTCVSHIGAILAF 

YTPSVISSVMHRVARCAAPHVHILLANFYLLF 

PPMVNPnYGVKTKQIRDSLGSIPEKGCVNRE 


769 


2119 


A 


6110 


1 


711 


RHEPSCSNGVASTKSKQNHSKYPAPSSSSSSS 
SSSSSSSPSSVNYSBSNSTDSTKSQHHSSTSNQ 
ETSDSEMEMEAEHYPNGVLGSMSTRIVNGAY 
KHEDLQTDESSMDDRHPRRQLCGGNQAATE 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
Naof 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

locatioE 

correspond] 

ngtofiist 

amino add 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alaiune C=Cystein6> 
J^^Aspartic Acid, H^^^^lutsmic Acid, 
F=*lienylalanine, 0=<31ycinc, IHIisttdinci 
I=]soleucine, K=Lysine, L=4^cine; 
M=M^onine, N=AspaTBgi]ie, P=Proline, 
O^Ghltamine. R=Arffinine. S=SeriTi& 
T=Threonine, V=Valine, W=Tiyptophan, 
Y=Tyrosinei X=KJiiknown, *=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insotion 














RIILFGRELQALSEQLGREYGKNLAHTEMLQD 
AFSLLAYSDPWSC3>VOOOI DPinnFPVPAAl 

NSAILESQNIJ'KQPH.MLALGQASECLRLMA 
RAGLGSCSFARYDDYLH 


770 


2120 


A 


6125 


2 


570 


YFGLNLHVQHLGNNVFLLQTLFGAVttJLANC 
VAPWALKYMNRRASQMLLMFLLAICLLAIIF 
VPQEMQMLREVLATLGLGASALANTLAFAH 

fTNFVTPTTTB AUAMOTMA'TTIA'MTAnAf AUi tun.! 

ILSVYSPPLPWnYGVFPFISGFAFLLLPETRNK 
FLFDTIQDEKNERKDPREPKQEDPRVEVTQF 


771 


2121 


A 


6126 


909 


333 


RSFVLDTASAICa^YNAHYKNHPKYWCRGYF 

RDYCNIIAFSPNSTNHVALRDTGNQLIVTMSC 

LTKEDTGWYWCGIQRDFAKDDMDFTEIJVT 

DDKGTLANDFWSGKDLSGNKmSCKAPKVV 

RKADRSRTSILnaUTGLGnSVISHLTKRRRS 

QKNRRVGNTLKPFSRVLTPKEMAPTEQM 


772 


2122 


A 


6148 


•7 


OI\J 


r VJLUlUAL^illiai't'MNJSJ'l'l'ASl'FNKQyQULF 

TQGSGENKraiNYEFDTKDLVCLGLSSIVGV 

WYLLRKHWIANNLFGLAFSLNGVELLHLNN 

VSTOCILLOOLFIYDVFWVFGTNVMVTVAKS 

FEAPIKLVFPQDIXBKGLEANNFAMLGLGDV 

VIPGIFIAUXRITJISLKKNTHTYFYTSFAAyiF 

GLGLTIFIMHIFKHAQPAIXYLVPAaGFPVLV 

ALAKGEVTEMFSYEESNPKDPAAVTESKEGT 

EASASKGLEKKEK 


773 


2123 


A 


6161 


3 


1088 


OJPMLVTRKNHPKLLLRRTESVAEKMLTNW 
FTFLLYKFLKESAGEPLFMLYCAIKHQMEKG 
PIDAITGEARYSLSEDKLIRHUDYKTLTLNCV 

AYKGVPYSQRPKAADMDLEWRQGRMARIIL 

QDEDVTTKIDNDWKRLNTLAHYQVrDGSSV 

ALVPKQTSAYNISNSSTFTKSLSRYESMLRTA 

SSPDSLRSRIPMrrPDLESGTKLWHLVKNHDH 

LDQREGDRGSKMVSEIYLTRIXATKGTLQKF 

VDDLFETIFSTAHRGSALPLAJKYMFi:a?LDEQ 

ADKHQIHDADVRHTWKSNCLPLRFWVNVIK 


774 


2124 


A 


6163 


860 


125 


KTAVKKRNLNPVFNETLRYSVPQAELQGRVL 
SLSVWHRESLGRNIFLGEVEVPLDTWDWGSE 
PTWLPLQPRVPPSPDDU^GLLALSLKYVPA 
GSEGAGLPPSGELHFWVKEAR0LLPLRAGSL 
DTYVOCFVLPDDSRASRORTRVVRR<II SPVF 

NHTMVYDGFGPADLRQACAELSLWDHOALA 
NRQLGOTRLSLOTGSSYGLQVPWMDSTPEEK 
QLWQALLEQPCEWVDGUPLRTNLAPRT 


775 


212S 


A 


6191 


2 


392 


ARGIGSLGRDHSGSGGGTGMAGAWVRKAAD 
YVRSKDFRDYLMSTHFWGPVANWGLPIAAIT 
DMK\KSPEIISRRMTFAL*CYSLTFVRFAHYVQ 
\PWNWLMLOCHTAVDFI)QLISSMPaSHGMT 
ASASAL 


776 


2126 


A 


6217 




827 


FRGYWGVRKAl-'mASWSGGLGPGKPGMKIT 
RQKHAKKHLGFFRNNFGVREPYQILLDGTFC 
QAALRGRIQLRBQLPRYLMGETQLCTTRCVL 
KELETLGKDLYGAKLIAQKCQVRNCPHFKNA 
VSGSECLLSMVffiGNPHHYFVATQDQNLSVK 
VKKKPGVPLMRIQNTMVLDKPSPKTIAFVKA 
VESGVRI^QCMRKKVSNISKItNRV»*KTLNRG 
RRKKRKKISGPNPLSCLKKKKKAPDTQSSASE 
KKRKRKRIRNRSNPKVLSEKQNAEGE 
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SEQO 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SBQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

begiimiiig 

nucleotide 

locatioo 

conespondi 

ngtofiist 

amino acid 

residue of 

pqrtide 

sequence 


Predicted end 
nucleotide 
location 
coirespondiog 
to last amino 
add residue 
of peptide 
sequence 


Amino acid sequence (A=AIanine OCysteine^ 
I>~Aspaitic Acid, £°<jlatamic Acid, 
F=Phenylalanine, 0=Glycine, H=4Cstidine^ 
I^Isoleucine, K=Lysine, L=Leacine; 
M=MEthionine, N=Asparagine, P^^Toline, 
Q=Glutaminei, R^Aiginine, S=Serine, 
T=Threonine, V=Valine, W=Tiyptophan, 
y=Tyrosine, X=Unknovra, *-Stop codon, 
/^possible nucleotide deletion, V^ssible 
nucleotide insralion 


777 


2127 


A 


6236 


1038 


1402 


YYQISSLPSIVONGIFLWLUCIFLAKQGGSRL* 
FQPFGRPRGGGHLRSGVLGQPGQHGETP/SFF 
YNSK1SPALWGPPVIPSALGGEAGKSL*PRRQ 
RFQRGGIAPLPSRVRQRAKLFLKKK 


778 


2128 


A 


6237 


422 


913 


AShbHHHRGAFLIXLAIPGS^GQDQSLIHWSN 
AVSNAD\LLDLKVN*LDH\LEEKMPL\EVKVVP 
PQVL\SEPN*RSGGCFSAPSFEVPPWTGEVKP/ 
SPQRDGGALG\QGPLGIPSDSILALLKKQT*RA 
LLNWPLGSLRRSSCFGGQDGQDLKPRSOLGC 
NSFRYRR 


T79 


2129 


A 


6249 


420 


36 


ARAPSPSFSVRDVELSDPARERGEMPVAVGP 

YGQSQPSCFDRVKMGFVMGCAVGMAAGAL 

FGTFSCLSSILVSSSG/SGMRGRHJvIGGIGKTM 

MQSGGTFGTEMAIGMGIRC*PWLPTTSVPSH 

QSQPMY 


780 


2130 


A 


6263 


415 


1380 


RIMRMajRGIQMLITTVGAFAAFSLMTIAVG 

TDYWLYSRGVCRTKSTSDNETSRKNEEVMT 

HSGLWRTCCLEGAFRGVCKK3DHFPEDADYE 

QDTAEYLLRAVRASSVFPILSVnXFFGGLCV 

AASEFHRSRHNVILSAGIFFVSAQLSNnGnVYI 

S\ANAGRTPGQR\DSKKSYSYGWSF/YFSGAFS 

FnOR/nC'GVGLPWHIYIEKHQQLRAKSHSEF 

LKKSTFARLPPYRYRFRRRSSSRSTEPRSRDLS 

PISKGFHTIPSTDISMFTLSRDPSKITMGTLLNS 

DRDHAFLQFHNSTPKEFKESLHNNPANRRTT 

PV 


781 


2131 


A 


6274 


832 


318 


RnKVKDLKQTLAKTAYPRCKCLVEMDQlFH 
LQVKQKQLACLCTWQARDPDCPPSTKWiyL 
VGPGMGCMVALFQDSIAWSNKSMPSSLSAIS 
QSPCQVQAPEGPSSFHLFIXSFTTCLSWQGGD 
LEFLGDLKGCSELKNFQEUTQSALVHPKADV 
WWYCGRPLLGTU'SN 


782 


2132 


A 


6281 


1324 


393 


W1SIJ>SSLLCRKNGSSAEDDRR\GEPSAEEAEG 

EREDWG1GSA»SVGAVSKVPSARF*RTYPS\E 

DEEEVTHQKSSSSDSNSEEHRKKKTSRSRNK 

KKRKNKSSKRKHRKYSDSDSNSESDTNSDSD 

DDKXRVKAKJOCKKiaaaiKTKKKKNKKTKK 

ESSDSSCBaJSEEDLSEATWMEQPNVAOTMDL 

IGPEAPinn^QDEKI^YGHAIXPGEGAAMA 

EYVKAGKRlPRRGEIGLTSEEIGSFECSGYVM 

SGSRHRRMEAVRLRKENQIYSADEKRAIASF 

NQEBRRKRESKILASFREMVHKKTKGKDDK 


783 


2133 


A 


630S 


201 


1032 


WDDYPQGALRRREAAEGLHFLOPPGRVRGQ 

LRGITCPAWYCHSPSHSLLSAFCHLPTPSRCP 

AMARPPVPGSVWPNWHES/RRGQGVPGLHS 

AQEPPAGVWAA*AASAAAA\LS]DTASYKIFV 

SGKSGVGKTALVAKLAGLEVPWHHETTGIQ 

TTWFWPAKLQASSRWMFRFEFWDCGESA 

LKKFDHMLLAGMENTOAFLFUWTDRASFE 

DLPGQLARIAGEAPGWRMVIGSKEDQYMHT 

DVPERDLTAFRQAWELPLLRVKSVPGRRLG 


/on 


01 Oil 


A 


6308 


86 


96 


GSSPDPASLITMKNQDKKNGAAICQSNPKSSP 

OQPEAOPEOAQERPSQAAPAVEAEOPGSSQA 

PRKreOAQARTAQSGALRDVSEELSRQLEDIL 

STYCVDNNQGGPGEDGAQGEPAEPEDAEKSR 

TYVARNGEPEPTPWNGEKEPSKGDPNTEEIR 

QSDEVGDRDHRRPQEKKKAKGLGKEriLLM 

QTLNTLSTPBEKLAALCKKYAELLEEHRNSQ 

KQMHXQKKQSQLVQEKDHLROEHSKAVLA 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

begimuog 

nucleotide 

location 

correspondi 

ngtofiist 

amino add 

residue of 

p^tide 

sequence 


Predicted end 
nucleotide 
location 
coiTcsponding 
tolastaoiioQ 
add residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
I>=Aspartic Acid, EfKilutamic Acid, 
F=PheaylaJanine, G=01ycine, HNICstidine, 
I=Isoleucine, K=I.ysine, L'^Ceucine, 
M=Me!iijonine, N=A^aiagine; P=ftt)line, 
Q=Glataniine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, W=TiyptoiAan, 
y=Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, V=possible 
nucleotide insertion 














RSKLESLCRELQRHNRSLXEEGVQRAREEEE 

KRKEVTSHFQVTLNDIQLQMEQHNERNSKLR 

QENMELAERLKKLIEQYELREEHIDKVFKHK 

DLQQQLVDAKLQQAQEMLKEAEERHQREKD 

FLLKEAVESQRMCELMKQQETHLKQQLALY 

TEKFEEFQNTLSKSSEVFTTFKQEMEKMTKKI 

KKI£KETTMYRSRWESSNKALLEN4AEEKTV 

RDKELEGLQVKIQRLEKLCRALQT/GAQ*PVR 

GQRWGSHRTSAVRIFS 


785 


2135 


A 


6319 


1493 


889 


SPQGPLLRSVSPVSAGASSVTPGGAQPGVTTT 

PPSLVAVAPAPGSAAGPAAGWQ»HAGCRyWT 

KLPWSWGMRPMKIFFSEEYRSISTRISHDAL» 

EKCTQPAKPLSMIR\TGSSVSPG/PLVKWNWT 

RREFRNSGTRWSSCCGMSCMYSFLGHCSV/S 

QDLPLVHVDVGWQPPLGPTVGLRPOLLPLHD 

TTPCQKLWDDLDWA 


786 


2136 


A 


6320 


331 


135 


RWLPVAECDSSCVGCTGEGPGNCKEaSGYA 

REHGQCADVDECSLAEKTCVRKNH^CYNTP 

GSYVCVCPDGFEET/RRCLCAAGRG*SHRRRK 

PDTAALPRRPVMCRTYPLNySEGCPVENVAL 

RMPSPAVDSGGERLPAL 


787 


2137 


A 


6330 


1693 


227 


DYVLTAELHRQRSPGVSFGLSVFNLMNAIMG 

SGILGLAYVMANTGVFGFSFLIXTVALLASYS 

VHLLLSMCIQTAYLGP*TNYFMVLPAH*LTCL 

PLIEFLQSL*NSL\*AVTSYEDLGLFAFGLPGKL 

WAGTIUQNIGAMSSYLLnKTELPAAIAEFLT 

ODYSRYWYLDOQTLLIDCVOIVFPLALLPKIO 

FLGYTSSLSFFFMKCFFALWriKKWSIPCPLTL 

NYVEKGFQISNVTDDCKPKLFHFSKESAYALP 

TMAFSFLCHTSILPIYCELQSPSKKRMQNVTN 

TAIALSFLIYFISAUFGYLTFYDyOTTKAQRGE 

VTCHRIKDKVESELLKG**»IP*SHDVVVKmV 

KIX:iLFAVIX\TmiHFPARKAVTMMFFSNFP 

FSWIRHFLITLALNIIIVU.AIYVPDIRNVFGVV 

GASTSTCXIFIFPGLFYLKLSREDFLSWKKLGV 

GCFCVLLSFKTSILRNSLSVYIILPASRKSIYFKI 


788 


2138 


A 


6351 


1 


6622 


PRSLCFSLWAEAAVLADGGLRRRRRLLRGTM 

SASFVPNGASLEDCHCNLFCLADLTGIKWKK 

YVWQGPTSAPILFPVTEEDPILSSFSRajKADV 

LGAnVRRDQRPERRE\L*IFWGGEDPWLLTLF 

TMTYQKKKMECGRMDFPMNAVLCFSKAVH 

NLLBRCLMNRNFVRIGKWFVKPYBKDEKPIN 

KSEHLSCSFTFFLHGDSNVCTSVEINQHQPVY 

LLSEEHITLAQQSNSPFQVILCPFGLNGTLTGQ 

AFKMSDSATKKUGEWKQFYFISCCLKEMSE 

EKQEDMDWEDDSLAAVEVLVAGVRMIYPAC 

FVLVPQSDIPTPSPVGSTHCSSSCLGVHQVPAS 

TRDPAMSSVTLTPPTSPEEVQTVDPQSVQKW 

VKFSSVSDGFNSDSTSHHGGKIPRKLANHW 

DRVWQECNMNRAQNKRKYSASSGGLCEEAT 

AAKVASWDFVEATQR'INCSCLRHKNLKSRN 

AGQQGQAPSLGQQQQIU»KHKTNEKQEKSEK 

PQKRPLTPFHHRVSVSDDVGMD\ADS\ASQRL 

V\ISAP\DSQ\VRFSNIR\TNDVAK\TPQMHGTE 

MANSPQPPPLSP\HPCDVVDEGVTKTPSTPQS 

QHFYQMPTPDPLVPSKPMEDRIDSLSQSFPPQ 

YQEAVEPTVYVGTAVNLEEDEANIAWKYYK 

FPKKKDVEFLPPQLPSDKFKDDPVGPFGQESV 

TSVTELMVQCKKPLKVSDELVQQYQIKNQCL 

SAIASDAEQEPKIDPYAFVEGDEEFLFPDKKD 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
coirespcmdiiig 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
I>=Aspartic Acid, EFOlutamic Acid, 
F=fhenylalanine, G=01ycine, H=Histidine, 
I=IsoIeudne, K=Lysine, LHLeucine. 
M=Metliiamne, N=Aspaiagine, P=Proline, 
Q=Ghrtamine, R=Argbiine, S=Serine, 
T=Threonine, V=Valine, W=Tiyptopiian, 
Y=Tyrosine, X'=Unknown, *=Stop codon, 
^^ssible nucleotide deletion, V=pcssiblc 
nucleotide insettioo 














RQNSEREAGKKHKVEDGTSSVTVLSHEEDA 

MSLFSPSKQDAPRPTSHARPPSTSLIYDSDLA 

VSYTDLDNLFNSDEDELTPGSKRSANGSDDK 

ASCXESKTGNLDPLSaSTADLHKMYPTPPSL 

EQHIMGFSPMNMIWKEYGSMDTTPGGTVLE 

GNSSSIGAQFKIEVDEGFCSPKPSEIKDFSYVY 

KPENCQILVGCSMFAPLKTLPSQYLPLIKLPEE 

CaYRQSWTVGKLELLSSGPSMPFKEGDGSNM 

DQEYGTAYTPQTHTSCGMPPSSAPPSNSGAGI 

LPSPSTPRFPTPRTPRIPRTPRGAGGPASAQOS 

VKYENSDLYSPASTPSTCRPLNSVEPATVPSIP 

EAHSLYVNULSESVMNLFKDCNSDSCXaCVC 

NMNKGADVGVYIPDPTQEAQYRCTCGFSAV 

MNRKFGNNSGLFFEDELDnGRNTDCGKEAE 

KRFEALRATSAEHVNGGLKESEKLSDDLILLL 

QDQCTNLFSPFGAADQDPFPKSGVISNWVRV 

EERDCX3TOCYLALEHGRQFMDNMSGGKVDE 

ALVKSSCLHPWSKRNDVSMQCSQDILRMLLS 

LQPVLQDAIQKKRTVRPWGVQGPLTWQQFH 

KMAGRGSYGTDESPEPLPIPTFLLGYDYDYLV 

LSPFALPYWERLMLEPYGSQRDIAYWLCa'E 

NEALLNOAKSFFRDLTAIYESCRLGQHRPVSR 

LLTDOIMRVOSTASKKLSEKLVAEWFSQAAD 

GNNEAFSKLKLYAQVCRYDLGPYLASLPLDS 

SLLSQPNLVAPTSQSLITPPQMTNTGNANTPS 

ATLASAASSTMTVTSGVAISTSVATANSTLTT 

ASTSSSSSSNLNSGVSSNKLPSFPPFGSMNSNA 

AOSMSTQANTVQSOQLOQQQTSALQTAGISG 

ESSSLPTQPHPDVSESTMDRDKVGIPTDGDSH 

AVrYPPAIYVYIIDPFTYENTDESTNSSSVWTL 

GUJlCW-EMVQTLPPHIlCSTVSVQnPCQYLLQ 

PVKHEDREIYPQHLKSLAFSAFTQCRRPLPTS 

TNVKTLTGFGPGLAMETALRSPDRPECIRLYA 

PPFDLAPVKDKQTELGETFGEAGQKYNVLFV 

OY(XSH]X}RWILASCTDLYGELLETCIINIDVP 

NRARRKKSSARKFGLQKLWEWCLGLVQMSS 

LPWRWIGRLGRIGHGELKDWSCLLSRRNLQ 

SLSKRLKDMCRMCGISAADSPSILSACLVAM 

EPQGSFVIMPDSVSTGSVFGRSTTLNMQTSQL 

NIPQDTSCTfflLVFPTSASVQVASATYTTENL 

DLAFNPNNDGADGMGIFDLLDTGDDLDPDn 

NILPASPTQSPVHSPGSHYPHGGDAGKGQSTD 

RLLSTEPHEEVPNILQQPLALGYFVSTAKAGP 

LPDWFWSACPQAQYQCPLFLKASLULHVPSV 

QSDELLHSKHSHPLDSNQTSDVLRFVLEQYN 

ALSWLTCDPATQDRRSCLPIHFWLNQLYNFl 

MNML 


789 


2139 


A 


6359 


1 


2002 


TGTI-TEUGIDVMGWPLKGQAFLPLVPEPRR 

LPVGPLLRALATCHALSRLQDTPVGDPMDLK 

MVBSTGWVLEEEPAADSAFGTQVLAVMRPP 

LWEPQLQAMEEPPVPVSVLHRFPFSSALQRM 

SVWAWPOATQPEAYVKGSPELVAGLCNPET 

VFTDFAQMLQSYTAAGYRWALASKPLPSVP 

SLEAAQQLTRDTVEGDLSLLGLLVMRNLLKP 

QTTPVIQAUUITRIRAVMVTGDNLQTAVTVA 

RGCGMVAPQBHLnVHATHPERGQPASLEFLP 

MESPTAVNGVKDPDQAASYTVEPDPRSRHLA 

LSOPTFOnVKHFPKLLPKVLVQGTVFARMAP 

EQKTELVCELQKLQYCVGMCGDGANDCGAL 

KAADVGISLSQAEASWSPFTSSMASIECVPM 
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PCT/USOl/03800 



SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
in 

USSN 
09/496 
914 


Predicted 

MI'HIIIIIIIIII 

nucleotide 
location 
cwre^ndi 
ng to first 
amino add 
residue of 
pq>tide 
sequence 


Predicted end 
nucleotide 
location 
corre^nding 
to last amino 
add residue 
of peptide 
sequence 


Amino add sequence (A-=Alanine OCysteine, 
i^Aspamc /vciu, xv^vjiutaniic Acic^ 
F=Phenylalanine, G=Glyciiie, H=Histidine, 
t=Isoleucine, K=Lysine, IM^cine, 
M=Metiiiaoine, I^AsparaginCi P=PiDline, 
Q=Glutaniine, R=Argtaine, S==Serine, 
T='nueonine, V=Valine. W=Tiypfcvhan, 
y-Tyrosinc, X=Unknown, •■^top codon, 
/=^ssible nucleotide deletion, V=possibIe 
nucleotide insertion 














VlKJbUKUaLIJ 1 oraVrKYMAL I sLlyFlsVLlL 

YTINTNLGDLQFLAIDLVrrTTVAVLMSRTGP 

ALVLGRVRPPGALLSVPVLSSLLLQMVLVTG 

VQLOOYFLTLAQPWFVPLNRTVAAPDNLPNY 

ENTWFSLSSFQYLILAAAVSKGAPFR\RPLTN 

NVPFLLASAL»SSVLWLVLSPGLLHGPLALR 

NITDTGFKLLLVGLVTLNFVGGLHAGERARP 

VPPRLPAPPPAQAG\SKKRFKQLERELAEQPW 

PPLPAGPLR 


790 


2140 


A 


6380 


76 


1059 


SSAGSARKLQVMALAAKLWRliPFRRGAAP 

GSRLPAGTSGSRGHCGPCRFRGFEVMGNPGT 

FKRGLLXSALSYLGFETYQVISQAAWHATA 

KVEEHJEQADYLYESGETEKLYQLLTQYKESE 

DAELLWRLARASRDVAQLSRTSEEBKKLLVY 

EALEYAKRA/L/EKNESSFASHKWYAICLSDV 

GDYBGIKAKIANAYIIKHIFEKA1HJ«IPKDATS 

IHLMGIWCYTFAEMPWYQRRIA*NACLQLPP 

*FPPYEKALG\YFHRAEQVDPNFYSKNLLLLG 

KTYLKUINKiaAAFWLMKAKDYPAHTEED 

KQIQTEAAQLLTSFSEKN 


791 


2141 


A 


6434 


3 


14«0 


IAIXrVT)GLAWDDQGGLALLHISPSKLIL*QDS 

SGMS/YVMVRCnTRAFFKSLLCHICQYSIGPQ 

•VT\CPGQDACKE*KSTAN»GG*RE*»PQVLFF 

AFLSNPAVKFGRMSKKQRDSLYAEVQKHQQ 

RLQEQRQQQSGEAEALARVYSSSBNGLSNLN 

NETSOTYANGSVIDLPKSEGYYNVVSGQPSP 

DQSGLDMTVGKQlKQEPIYDLTSVPNLrmSS 

FNN\GQLAPGrr\MrEIDRIAQNIIKSHLETCQY 

TMEELHQLAWQTHTYEEIKAYQSKSREALW 

QQCAIQrrHAIQYWEFAKRITGFMELCQNDQ 

ILIXKSGOJEVVLVRMaiAFNPUWTVLFEG 

KYGGMQMFKALGSDDLVNEAFDFAKNLCSL 

QLTEEEIALFSSAVUSPDRAWLIEPRKVQKLQ 

EKIYFALQHVIQKNHLDDBTLAKLIAKIPTITA 

VCNLHGEKLQVFKQSHPEIVNTLFPPLYKELF 

NPDCATACK 


792 


2142 


A 


6440 


92 


781 


SRGI'FRCFCaRDFFPCFSNMRLFLWNAVLTLFV 

TSLIOALIPEPEVKIEVLQKPFICHRKTKGODL 

MLVHYEGYLEKDGSLFHSTHKHNNGQPIWFT 

LGILEALKGWGPGA*K/DMCVGEKRKLIIPPA 

LGYGKEGKGKIPPESIXIFNIDLLEIRNGPRSH 

ESFQEN4DL1TODWKLSKDEVKAYLKKEFEKH 

GAWNESHHDALVEDIFDKEDEDKDGnSAR 

EFTYKHDEL 


793 


2143 


A 


6446 


3201 


152 


PRUCRLWrEEDGOARPEALGKIAPRTPAELG 

ARADQELVTALMCDLRRPAAGGMMDLAYV 

CEWEKWSKSTHCPSVPLACAWSC3««JLIAFTM 

DUlSDDQDLTlUvflHILirrEHPWDLHSIPSEHH 

EArrc\LEWDQSOFPGPLFSRWPTX)QIKNCWS 

MGVSTLAVNSWE\SSVGSL\VEGGPHLWALS\ 

WLHVNQVKLALHVEKSGASSFGEKFSRWKFS 

P\SLTLF\G04AMFGWIAVTVSGL'VTV«!I I 0\P 

SGQVLVrSTVESLCRLRARVALADIAFTGGGNI 

WATADGSSAVSPVQFYKVCVSWSEKCRIDT 

DIU>SIJFMR<mT>LbntKDKFPAnHLKFLARD 

MSEQVLLCASSQTSSrVBCWSLRKEGLPVNNI 

FQQISPWGDKQPTILKWRILSATNDU3RVSA 

V\ALPKLPISLTmi>LKVASDTQFYPGLGLAL 

AFHDGSVmVHRLSLQTMAVFYSSAAPRPVD 

EPAMKRPRTAGPAVHLKAMQLSWTSLALVG 
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SEQ ID 

NO: of 

nuc^ 

eotide 

seq- 

uaice 


SEQ ID 
NO: of 
pq>tide 
seq- 
uence 


Met 
hod 


SEQ 
ED NO: 
in 

USSN 
09/496 
914 


Predicted 

begiiming 

nucleotide 

location 

ccsrespondi 

ng to first 

amino add 

lesidueof 

pq)tide 

sequence 


Predicted end 
nocleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A<=Alanine O^Cysteine, 
EMAspartic Acid, E=<jlutamic Acid, 
F=Phenylalanine, G=Glycine, H=Histidine, 
I-lsolcucine, K-Lysine, L=€^cine, 
M=Metlu<Hiine, N=Asparagine, P=Proline, 
QOhitamine, R=Aiginine. S=Scrine, 
T=Threonme, V=VaJine, W=Tiyptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=^ssible nucleotide deletion, V^ssible 
nucleotide insertion 














IDSHGKLSWLRLSPSMGHPLEVGLALRHLLFL 

LEYCMVTGYDWWDILLHVQPSMVQSLVEKL 

HEEYTOQTAALQQVLSTRILAIVIKASLCKLSP 

CIVrnvajYimaFLrAlSSTLKSLLRPHFLNT 

PDKSPGD1U.TEICTK]TDVD1DKVMINLKTEEF 

VLDMNTLQALQQLLQWVGDFSO-YLLASLPN 

QPCPTSEPCPTSEPSPTSEPSPTSEPSSP»SLC\G 

SUJU>GHSFLRDGTSLGMLRELK1VVIRIWGLL 

KPSCLPVYTATSDTQDSMSLLFRLLTKLWICC 

RDEGPASEPDEALVDECCLLPSQLUPSLDWL 

PASDQLVSRLQPKQPLRLQFGRAFTLPGSAAT 

LQLDGLARAPGQPKIDHLRIUJHLGAC3PTEEC 

KACTRCGCVTMLKSPNRTTAVKQWEQRWIK 

NC/LVRWALVAGAPQU»LSPAAPQLLLSYPSA 

APEPGCOCSHRSPWTLLGAVNLSPPCRAVEG 

RGPDACVTSRASEEAPAFVQLGPQSTHHSPRT 

PRSLDHLHPEDRP 


794 


2144 


A 


6490 


418 


585 


NGDKADLENESCRAQVLMPWPAL'WEAEGG 
GSIEPRDLRLQ*AVITPL\TPAWVTQ 


795 


2145 


A 


6499 


395 


1027 


KLLWLPPHSEQKRSPLYHPQGPSGTTPSAP\FS 

SHSPPPSLLQA\PSlAAFLRTHGfflSASOPLRMP 

FPH/H*NAFLLVFPGQRSQLTS/PSHYLCREVFP 

DHHHHLCRLSl^SPLFHHRVLFCVPKQNVN 

STRAQIFCLFVHIVGCRCINTFFLHLFRLHLWL 

HFLQ1PLCKKNKSVKLGKTWGRGCQSAAGS 

DTRVRAAVGAPGLPVEPLV 


796 


2146 


A 


6503 


68 


936 


HSALLTHSSFCVFTLCQDFFTYSSMSEEVTYA 
DLOFONSSEMEKIPElOJCFfiFK' APPA PCHVWP 

PAALFLTLLOXLUGLGVLASMFHVTLKIEM 

KKMNKLQNISEELQRNISLQLMSNKl>aSNKlR 

NLSTTLQTIATKLCRELYSKEQEHKCKPCPKR 

WIWHKDSCYFLSDDVQTWQESKMACAAQN 

ASIXKINNKNALEFIKSQSRSYDYWLOLSPEE 

DS/YSWYESG*YNQ\PSAWVIRNAPDLNNMY 

CGYINRLYVQyYHCTYKQRMICEKMANPVQ 

LGSTYFREA 


797 


2147 


A 


6507 


1 


881 


pgsthasarsqvprsageaAphsrrppgllph 
apraasaqleermrdphpghftlqegdcrgs 
qtvsltmotadsdemapeapqhthidvhihq 
esalakllltccsalrpratqargssrllvas 

WVMQrsnxm^VLGGFFmDYTLLVTSGA 

AIWTGAVAVLAGAAAFIYEKRGGTYWALLR 

TLLALAAFSTAIAALKLWNEDFRYGYSYYNS 

ACRISSSSDWNTPAPTQSPHEVRRLHLCTSFM 

DMUCALFRTLQAMLLOVWILLLLASLTPLWL 

/SL/RGECSQPKG*VPKKRDQKEMLEVSGI»PG 

STHASARSQVPRSAGEAAPHSRRPPGLLPHAP 

RAASAQLEERNCRDPHPGMTLQEGDCRGSQT 

VSLTMGTADSDHvlAPEAPQHTHIDVHIHQES 

ALAKLLLTCCSALRPRATQARGSSRLLVASW 

VMQIVLGILSAVLGGFFYIRDYTLLVTSGAAI 

WTGAVAVLAGAAAFiyBKRGGTYWALLRTL 

LALAAFSTAIAALKLWNEDFRYGYSYYNSAC 

RISSSSDWNTPAPTQSPEEVRRLHLCTSFMDM 

LKALFRlLQAMLLGVWILLLLASLmWLYC 

WRMFPTKGVSP 


798 


2148 


A 


6S28 


912 


2287 


VPNYLPSVSSAIGGEVPQRYVWRFaGLHSAP 
RFLVAFAYWNHYLSCTSPCSCYRPLCKLNFG 
LNVVENLALLVLTyVSSSEDF/TWVPG*GRSG 
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SEQff) 
wu. 01 
nucl- 
eotide 
seq- 
uence 


SEQID 

IH\J. OI 

pq>tide 
seq- 
uence 


Met 

noa 


SEQ 
11/ rtU, 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ngtofiist 

amino add 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
correspcnding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine; 
D=Aspaitic Acid, E=<31utamic Acid, 
F=4TienylaIanine, G=<}lycine, H^Histidine. 
h^Isoleudne, K=Lysine, L=Ceucine, 
M°^ed>ianine,N=Asparagine, P=Proline, 
Q=<}|utamine, R=Aisinine, S=Serine, 
T=Threonine, V=Valine, W=Tiyptojdian, 
Y=Tyrosinci X=Unlcno'wn, ♦=Stop codon. 
possible nucleotide deletion, V=possible 
nucleotide insertion 














EVFPEGTGLPLPHSDLPTSWCX}HSLQCGSQSS 

FPPAIHENAFIVFIASSLGHMLLTCILWRLTKK 

HTVSQBMDGLSLAGAPRQPRRKSRTSVLRIRV 

MVRWELSSNGNPGRGVLGLGLGLGNKLRVV 

GQNLGL*HCVWWWETGE*KRWRLQMGIE» 

GVASRRQ*VRNSVRGLVCHNSSAPPMyMGFF 

SPTVFGGGVGG*LHVTFILHPPBVEAAGIPLX.L 

GPSLPQRQGREHIVVILAAPACAPFHDR*WEP 

REIRPSP*ELGLRGEPTLSYPASCRVIRQPIP*D 

RKSYSWKQRLFIINFISFFSALAVYFRHNMYC 

EAGVYTIFAILEYTVVLTNMAFHMTAWWDF 

GNKELLITSQPEEKRF 


799 


2149 


A 


6529 


1 


874 


m'FQRlNHEHSGSVSLLALACDLGWCSEDWS 

CCX,VQGGGDLVDWQTNHGEDEAGODTDSV 

DEARCKESQQEAQENLREDLCLESFAKDKIL 

QIIEGSEREHEETRTKQAALDGEPLGGGQLTA 

VHLHPSKEQQGQEGGERQROARTHHWRGW 

EKGRRVRLRPPSGKLRADQPVRKLGOPTPS/T 

HJPGLQPHAPTPHTA/PATFTYSPAPimWFV 

RWKCPLPVEPRTRQLCRERTRKACPPKPRPPL 

CH.PODPTOPVTHHAPPVSPTOASGQERJRAEP 

GAVSYAHASATK 


800 


2150 


A 


6544 


2 


662 


SAQRWAAVAGRWGCRLLALLLLVPGPGGAS 

ElTFELPDNAKQCFYEDIAQGTKCTLEFQVrrG 

GHYDVDCRLEDPDGKVLYKBMKKQYDSFTF 

TASKNGTYKFCFSNEVFSTFTHKTVYFDFQVO 

E\THLCFL\T?/DRVSALTQMESACVSIHEALKS 

VIDYQTHFRLREAQGRSRAEDLNTRVAYWSV 

GEALILLWSIGQVFLLKSFFSDKRTTTTRVGS 


801 


2IS1 


A 


6556 


1 


1319 


TPC34ECIKGEGLREPQNLSGSQRBPQTEGSM 

DGWRRMPRWGIXLIXWGSCrFGLPTDTTTF 

KRIFLKRMPSIRESLKERGVDMARLOPEWSQP 

MKRLTLG>nTSSVILTNYMDTQYYGEIGlGTP 

PQTFKWFDTGSSNVWVPSSKCSRLYTACVY 

HKLFDASDSSSYKHNG'reLTLRYSTGTVSGFL 

SQDnrVGGTTVTQMFGEVTEMPALPFNlLAEF 

DGWGMGFIEQAIGRVTPIFDNIISQGVLKED 

VFSFYYNRDSEKSQSLOOQIVLGGSDPQHYE 

GNFHYINLIKTGVWQIQMKGVSVGSSTLLCE 

DGCLALVDTGASYISGSTSSIEKLMEALGAKE 

KRLFDYWKCNEGPTLPPTFLFLLGGKDTPLT 

SADYLFQESYSSKKLSTLAIHAMYIPPPTGPTL 

\ALGATFMRKFYTEFDRGNNPHGFALAR 


802 


2152 


A 


6567 


13 


6147 


MCLGRMGASSPRSPEPVGPPAPGLPFCCGGSL 

lAVVVlXALPVAWGQCNAPEWUJPFARPTNI, 

TDEFEFPIGTYLNYECRPGYSGRPFSnCLKNS 

VWTGAKDRCRRKSCRNPPDPVNGMVHVKG 

IQFGSQlKYSCrKGYRLIGSSSATCJISGDTVlW 

DNETPICDRIPCGLPPnTNGDFISTNRENFHY 

GSWTYRCNPGSGGRKVFELVGEPSIYCTSND 

DQVGIWSGPAPQCUPNKCTPPNVENGILVSD 

LNKWEPELPSCSRVCXJPPPDVLHAERTQRDK 

DNFSPGQEVFYSCEPGYDLRGAASMRCTPQO 

DWSPAAPTCEVKSCDDFMGQLLNGRVLFPV 

NLQLGAKVDFVCDEGFQLKGSSASYCVLAG 

MESLWNSSVPVCEQIFCPSPPVIPNGRHTGKP 

LBVFPFGKAVNYTCDPHPDRGTSFDUGESITR 

CTSDPQGNGVWSSPAPRCGILGHCQAPDHFL 

FAKLKTQTNASDFPIGTSLKYECKPEYYGRPF | 
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SEQ ED 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

b^Inning 

nucleotide 

location 

correspond] 

ngtofiist 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
correspcmding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A<=Alanine 0=Cysteine, 
D~Aspartic Acid, EKjlutamic Acid, 
F^Pbcnylalanine, G=Glycine, H=Histidinc, 
I=Isoleucine; K=Lysine, L=Leucine; 
M=^ethionine, N=Aspaiagine, PNProline, 
Q=Glutamine, R^^Argiiainc, S^^erine, 
T^Threonine, V=VaIine, W=Ttyptoplian, 
Y'=^yro8ine, X=Unknown, '"Stop codon, 
A^ssiblc nudeotide deletion, V^ossible 
nucleotide insertioo 














SITCLDNLVWSSPKDVCKRKSCKTPPDPVNG 

MVHVTTDIQVGSRINYSCTTGHRLIGHSSAECI • 

LSGNAAHWSTKPPICQRIPCGLPPnANGDHS 

TNRENFHYQSWTYRCNPOSOORKVFELVGE 

PSIYCTSNDDQVGIWSGPAPQCnPNKCTPPNV 

ENGE.VSDNRSLFSUJEWEFRCQPGFVMKCH' 

RRVKCQALNKWEPELPSCSRVCQPPPDVLHA 

ERTQRDKDNFSPGQEVFYSCEPGYDLRGAAS 

MRCTPQGDWSPAAPTCEVKSCDDFMGQLLN 

QRVLFPVNLQLGAKVDFVCDEGFQLKGSSAS 

YCVLAGMESLWNSSVPVCEQIFCPSPPVIPNG 

RHTGKPLEVFPFGKAVNYTCDPHPDRGTSFD 

UGESTTRCTSDPQGNGVWSSPAPRCGILGHC 

QAPDHFLFAKLKTCJTNASDFPIGTSLKYECRP 

EYYGHPFSrrCLDNLVWSSPKDVCKRKSCKTP 

PDPVNGMVHVITDIQVGSRINYSCTTGHRLIG 

HSSAECILSGNTAHWSTlCPPICQRIPCX3LPPn 

ANGDFISTNRENFHYGSWTYRCNLGSRGRK 

VFELVGEPSIYCTSNDDQVGIWSGPAPQaiPN 

KCITPNVENGILVSDNRSLFSLNEWEFRCQP 

GFVMKGPRRVKCQALNKWEPELPSCSRVCQ 

PPPEILHOEHTPSHQDNFSPGQEVFYSCEPGY 

DLRGAASLHCTPQGDWSPEAPRCAVKSCDDF 

LGQLPHGRVLFPU«.QLGAKVSFVCDEGFRL 

KGSSVSHCVLVGMRSLWNNSVPVCEHIFCPN 

PPAILNGRHTGTPSGDIPYGKEISYTCDPHPDR 

GMTFNLIGESmCrSDPHGNGVWSSPAPRCE 

LSVRAOHCKTPEQFPFASPnPINDFEFPVGTS 

LNYECRPOYFGKMFSISCLENLVWSSVEDNC 

RRKSCQPPPEPFNGMVmNTDTQFGSTVNYSC 

NEGFRUGSPSTTCLVSGNNVTWDKKAPICEn 

SCEPPPTISNGDFYSNNRTSFHNGTWTYQCH 

TGPDGEQLFELVGERSIYCrSKDDQVGVWSS 

PPPRaSTNKCTAreVENAIRVPGNRSFFSLTEI 

IRFRCQPQFVMVGSHTVQCQTNORWOPKLPH 

CSRVCQPPPEDLHGEHTLSHQDNFSPGQEVFY 

SCEPSYDLRGAASLHCTPQGDWSPEAPRCTV 

KSCDDFLGQLPHCaiVLLPLNLQLGAKVSFVC 

DEGFRLKGRSASHCVLAGMKALWNSSVPVC 

EQIFCPNPPAILNGRHTGTPLGDIPYGKEVSYT 

CXIPHPDRGMTFNLIGESTIKRTSEPHGNGVWS 

SPAPRCaj»VGAACPHPPKIQNGHYIGGHVSL 

YIJPGK4TISYTCDPGYLLVGKGFIFCnX}GIWS 

QLDHYCKEVNCSFPLFMNGISKELEMKKVYH 

YOJYVTUCCEDGYTLEGSPWSQCQADDRWD 

PPLAKCTSRTHDALIvaTLSGTIFFniIIFLSWI 

ILKHRKGNNAHENPKEVAIHLHSQGGSSVHP 

RTLQTNEENSRVLP 


803 


2153 


A 


6574 


2 


3233 


hgrsarlaavpaeampgprrpagsrlrlllL 

lllpplllllrg\shagnltvavvlplantsy 

pwswa\rvgpavelalaqvkarpdllpgwt 

vrtvlossenalgvcsdtaaplaavdlkwe 

hnpavflgpgcvyaaapvgrftahwrvpll 

tagapalgfgvkdeyalttragpsyaklgdf 

vaalhrrlgwerqalmlyayrpgdeehcff 

lveglfmrvrdrlnrrvdhlefaeddlshyt 

rllrtmprkorviyicsspdafrtlmllalea 

glcgedyvffhldbfgqslqggqgpaprrpw 

ergdgqdvsarqafqaakiitykdpdnpeyl 

eflkqlkhlayeqfnftmedglvntipasfh 
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SEQro 

NO: of 
nucl- 
eotide . 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
bod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

coiTeqx>Ddi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
correqxmding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequmce (A=AIannie C=Cystein6, 
I>=Aspaitic Acid, E^Ghitamic Acid, 
F=Pheiiylalanine, 0=Glycine, H=Histidine, 
I^Isoleucine. K=l,ysinei L^Leudne, 
M=4^etl)ionine, N^^A^iaragine, f=PioUne, 
Q=Glutamine, R=Aiginine, S=^erine, 
T=Threonine, V=Valine, W=Tiyptophan, 
Y=TyK>sine, X^Unknown, *^top codon, 
/^^ssible nucleotide deletion, \p=possibIe 
nucleotide insertion 














IXjUXYlQAVrin'LAHGGTVTDGENITQRMW 

NRSFQGVTGYLKIDSSGDRETDFSLWDMDPE 

NGAFRWLNYNGTSQELVAVSGRKLNWPLG 

YPPPDIPKCGFDNEDPACNQDHLSTLEVLALV 

OSLSIJLGILrVSFFIYRKM(H.EKELASELWRVR 

WEDVEPSSLERHLRSAGSRLTLSGRGSNYGSL 

LTTEGQFQVFAXTAYYKC»1LVAVKRVNRKR 

lELTRKVIJ'ELKHMRDVQNEHLTRFVGACTD 

ITNlCILTEVCPRGSLQDILENESITLDWMFRy 

SLTNDIVKGMLFLHNQAICSHGNLKSSNCW 

DGRPVLKrrDYOLESFRDLDPEQGHTVYAKK 

LWTAPELLRMASPPVRGSQAGDVYSFGIILQE 

lALRSGVFHVEGLDLSPKEDERVTRGEQPPFR 

PSLALQSHLEELGLLMQRCWAEDPQERPPFQ 

QIRLTLRKFNRENSSNILDNLLSRMEQYANNL 

EELVffiRTQAYLEEKRKAEALLYQILPHSVAE 

QLKRGETVQAEAFDSVmfFSDIVGFTALSAE 

STPMQWnXNDLYTCTOAVIDNFDVYKVET 

IGDAYMVVSGLPVRNGRUIACEVARMALAL 

LDAVRSFRIRHRPQEQLRLRIGIHrGPVCAGV 

VGIJCMPRyCLFGDTVNTASRMESNGEAL\KI 

HLSSVBTKAVL^EEFGGFELELRGDVEMKGKG 

KVRTYWLLGEROSSTRG 


804 


2134 


A 


6383 


2 


3837 


DAPGRPPVRLPTMELEDGWYQEEPGGSGA V 

MSERVSGLAGSIYREFERLIVRYDEEWKELIP 

LWAVLENLDSVFAQDQEHQVELELLRDDNE 

QLrrQYEREKALRKHAEHOTEFEDSQEQEKK 

DLQTRVESLESQTRQLELKAKNYADQISILEE 

REAELKKEYNALHQRHTEMIHNYMEHLERT 

KLHQLSGSDQUESTAHSRIRKERPISLGIFPLP 

AGDGLLTPDAQKGOCTPGSEQWKFQEI^QPR 

SHTSLJDELSDVSQGGSKATTPASTANSDVA 

Tn»TUn»IJCEENEGFVlCVTDAFNKSEISKHIBV 

QVAQETRNVSTGSAENEEKSEVQAIIESTPEL 

DMDKDLSOYKOSSTPTKGIENKAFDRNTESL 

FEELSSAGSGLIGDVDEGADLLGMGREVENLI 

LENTQIXETKNAIJCVKNDLIAKVDELTCEK 

DVLQGELEAVKQAKLKLEEKNRELEEELRKA 

RAEAEDARQKAKDDDDSDIPTAQRKRFTRVE 

MARVLMERNQYKERLMELQEAVRWTHvIIR 

ASRENPAMQEKKRSSIWQFFSRLFSSSSNTTK 

KPEPPVNLKYNAPTSHVTPSVKKRSSTLSQLP 

GDKSKAFDFLSEETEASLASRREQKREQYRQ 

VKAHVQKEDGRVQAFGWSLPQKYKQVTNG 

QGENKMKNLPVPVYLRPLDEKDTSMKLWCA 

VGVNLSGGKTRDGOSWOASVFYKDVAOLD 

"lEGSKQRSASQSSLDKlDQELKEQQKELKNQ 

EELSSLVWICTSTHSATKVLIIDAVQPGNIIJJS 

FTVCNSHVLOASVPGARETDYPAGEDLSESG 

QVDKASLCGSMTSNSSAETDSLLGGITWGC 

SAEGVTGAATSPSTNGASPVMDKPPEMEAEN 

SEVDENVPTAEE\ATEATEONAGSAEDTV\DIS 

QTGVYTEHVFTDH^WQIPEDLSPVyQSSND 

SDAYKDQISVLPNEQDLVREEAQKMSSLLPT 

MWLGAQNGCLYVHSSVAQWRKCLHSnCLKD 

Sn^IVHVKGIVLVALADOTLAIFHRGVDGQW 

DLSNYHUJDLGRPHHSlRCMrVVHDKVWCO 

YRNKIYWQPKAMKIEKSFDAHPRKESQVRQ 

LAWVGDGVWVSIRLDSTLRLYHAHTYQHLQ 

DVDIEPYVSKMLGTGKLGFSFVRITALMVSC 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

conespondi 

ng to first 

amino add 

residue of 

pqTtide 

seqnence 


Predicted end 
nucleotide 
location 
coneq)onding 
to last amino 
acid residue 
of peptide 
sequence 


Amino add sequence (A^Alanine C=Cysteine, 
D-Aspaitic Add, E=Glntamic Acid, 
F=PhcnylaIanine, 0=Glycine, H=Histidine. 
I=Isoleucine, K=Lysine, LF=Leucine, 
M=Methionine, N=Aspaiagine, P=Proline, 
QOlutaminei R=Aigiiune, S=^erine, 
T^Threonine, V=Valinei W=Ttyptophan, 
Y=Tyrosinc, X=Unknown, *=Stop codon, 
A=possiblc nucleotidB ddetion, \=possible 
nucleotide insertion 














NRLWVGTGNGVnSIPLTETVILHQGRLLGLR 
ANKTSGVPGNRPGSVlRVYGD»fSDKVTPGT 
FIPYCSMAHAQLCFHGHRDAVKFFVAVPGQV 
ISPQSSSSGTDLTODKGROHLHRSLWRRP 


SOS 


2153 


A 


6605 


469 


2602 


FGBILLWGTAFKSWKMKAPIPHLILLYATFTQ 

SLKWTKRGSADGCTDWSIDIKKyQVLVGEP 

VRIKCALFYGYIRTNYSLAQSAGLSLMWYKS 

SGPGDFEEPIAFDGSRMSKEEDSIWFRPTLLQ 

DSGLYACVIRNSTYCMKVSISLTVGENDTGL 

CWSKMKYFEKAELSKSKEISCRDIEDFLLPT 

REPEILWYKECRTKTWKPSrVFKRDTLLIREV 

REDDIGNYTCELKYGGFWRltTTELTVTAPL 

TDKPPKLLYPMESKLTIQETQLGDSANLTCRA 

FPGYSGDVSPLJYWMKGEKFIEDLDENRVWE 

SDWOLKEHLGEQEVSISLIVDSVEEGDLGNYS 

CYVENGNGRKHASVLLHKREIMYTVELAGG 

LGAIUXLVCLVTIYKCYKIEIMLFYKNHFGA 

EELDGDNKDYDAYLSYTKVDPDQWNQETGE 

EERFALEILPDMLEKHYGYKLFIPDRDLIPTGT 

YIEDVARCVDQSKRLnVMTPNYWRRGWSIF 

ELETRLKNMLVTGBIKVILIECSELRGIMNYQE 

VEALKHTIKLLTVIKWHGHCCNKLNSKFWKR 

LQYEMPFKRIEPITHEQALDVSEQGPFGELQT 

VSAISMAAATSTALATAHPDLRSTFHNTYHS 

QMRQKHYYRSYEYDVPPTGTLPLTSIGNQHT 

YCNIPMTLINGQRPQTKSSREQNPDEAHTNSA 

ILPLLPRETSISSVIW 


806 


2136 


A 


6614 


3 


1384 


NSARGGVGVRGARAMATVQEKAAALNLSAL 

HSPAHRPPGFSVAQKPFGATYVWSSUNTLQT 

QVEVKKIUIHRLKRHNIXTVGSEAVDVIFSHL 

IQNKYFGDVDIPRAKWRVCQALMDYKVFE 

AVPTKVFGKDKKFTFEDSSCSLYRFTTIPNQD 

SQLGKENKLYSPARYADALFKSSDmSASLJED 

LWENLSLKPANSPHVNISTTLSPQVINEVWQE 

ETIGRLLQLNrDLPLLDSlXKQQEAVPKIPQPK 

RQSTMVNSSNYUJRGILKAYSDSQEDEWLSA 

AIDCLEYLKDQMWEISRSFPEC^DRTDLVKE 

LLfDAIGRYYSSREPLLNHLSDVHNGIAELLV 

NGKTHALEATQLLLKLLDFQNREEFKRLLYF 

MAVAANFSEFKLQKESDNRMWKRIFSKAIV 

DNKNLSKGKTDLLVLFLVMDHQKDVFKIPOT 

LVHiaVS\VK\LMAIQNGRDPNRDAGYIYCQRI 

DQRDYSNIIEKITIDEIXYLLKTLDEDSKLSA 

KEKKKMXGQFYKCHPDIFIEHFGD 


807 


2157 


A 


6613 


4198 


2094 


FGIVGITAUfI'DIiLDSDRDPAIFSLCDFGAMR 

PQH-TJ .1 .Al .T .TLGLAAQHQDKVPCaCM/VKML 

CPDRVDKKVSC3QVLGLLQVPSVLPPDTET1J> 

LSGNQLRSILASPLGFYTALRHLDLSTNBISFL 

QPGAFQALTHLEHLSLAHNRLAMATALSAG 

GLOPLPRVTSLDLSGNSLYSGLLERliOEAPS 

LHTLSLAENSLTRLTRHTFRDMPALEQIDLHS 

NVI^IEDGAFEGLPRLTHLNLSRNSLTaSD 

FSLQQLRVLDLSCNSIEAFQTAS\QPQAEFQLT 

WIDUIENKLLHFPDLAALPRLIYLNLSNNLIR 

LPTGPPQDSKGIHAPSEGWSALPLS\APSGNAS 

GRPLSQLLNLDLSYNEIHJPDSFLEHLTSLCFL 

NLSRNCLRTFEARRLGSLPCLMLLDLSHNALE 

TLELGARALG\SLRTLLLQGNAUCDLPPYTFA 

NLASLQRLNLQGNRVSPCGGPDEPGP\SGCV\ 

AFSGrrSLRSLSLVDNBIHLLRAGAFLHrPLTE 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO:of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Picdicted 

beginning 

nucleotide 

location 

coiTes[X)ndi 

Qg to first 

amino add 

residue of 

peptide 

sequence 


Pmlictedend 
nucleotide 
locatim 
coiresponding 
to last amino 
add residue 
ofpqitide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
D=Aspa]tic Acid, B=Glutamic Acid, 
P=Hienylalanine, G=Glycine, H^Histidine, 
I=Isoleucine, KHLysine, L^Leucine, 
M=Methionine, N=Aq)aiagine, P=Prolinei 
0=Glutaminc R=Arginine, S=Serine. 
T=aTireoninei V=Valine, W=Tiyptoidian, • 
Y=Tyrosine, X=Unknown, •^top codon, 
/^posable nucleotide deletion, V^ossible 
nucleotide insertion 














LDLSSNPGLEVATGALGOLEASLEVLALQGN 

GUm,QVDLPCFlCLKRLNLAENRLSHLPAW 

TQAVSLEVLDLRNNSFSLLPGSAMGGLETSLR 

RLYLQGNPLSCCGNGWLAAQLHQGRVDVDA 

TQDUC31FSSQEEVSLSHVRPEDCEKGGLKNI 

NLIELTFILVSAnXTn-AACCX^VRRQXFNQQ 

YKA 


808 


2158 


A 


6619 


153 


1852 


FKAI^YIYTNimEREAAFEVAILLRRMEEG 

ARHKNNTEKKHPGGGESDASPEAGSGGGGV 

ALKKEIGLVSACGIIVGNIIGSGIFVSPKGVLEN 

AGSVGLALIVWrVTGFITVVOALCYAELOVNI 

PKSGGDYFYVKDIFGGLAGFLRLWIAVLVIYP 

TNQAVIALTFSNYVLQPLFPTCFPPESGUUiA 

AICLLLLTWVNCSSVRWATRVQDIFTAGKLL 

ALAUIlMGrVQICXGEym^KNAEENFQEP 

DIGLVALAFLQGSFAYGGWNFLNYWTEELV 

DPVYKNLVPRAIFISIPNLVTFVYVFANV/ALYVT 

AMSPQEL\LASVNAVAVTFGEKLLGVMAWIM 

PISVALSTFGGVNGSLFTSSRLFFAGAREGHLP 

SVLAMIHVKRCTPIPALLFTaSTLLAILVTSD 

MYTLINYVGFINYLFYGVTVAOQIVLRWKKP 

DIPRPKINLLFPIIYLLFWAFLLVFSLWSEPVV 

CGIGLADviLTGVPVYFLOVyWQHKPKCFSDn 

ELLTLVSQKMCVWYPEVERGSGTEEANED 

MEEQQQPMYQPTPTKDKDVAGQPQP 


809 


2159 


A 


6621 


1041 


223 


QDSRKMLPSTSVNSLVQGNGVLNSRDAARH 

TAGAKRYKYLRRLFRFRQMDFEFAAWQMLY 

LFTSPQRVYRNFHYRKQTKDQWARDDPAFL 

VLLSnVLCVSTIGFOFVLDMOFFETIKLLLWV 

VLIDCVGVGLLIATLMWnSNKYLVKRQSRD 

YDVEWGYAFDVHLNAFYPULVILHnQLFFIN 

HVILTDTnGYLVGNTLWLVAVGYYIYVTFL 

GYSVGLLFFSULmJCNTVILLYPFAPLILLYG 

LSLALGWNFTHTLCSFYKYRVK 


810 


2160 


A 


6633 


160 


822 


SPASOHCRLNGAAVAMFGCLVAGRLVQTAA 
QQVAEDKFVFDLPDYESINHWVF*fl,GTIPFP 
EGMGGSVYFSYPDSNGMPVWQLLGFVTNGK 
PSAIFKISGLKSGEGSQHPFGAMNIVRTPSVAQ 
IGISVELLDSMAQQTPVONAAVSSVDSFTQFT 
QKMLDNFYNFASSFAVSQ/VPDDTQ/RPSHvIF 
IPANWLKWYENFQRRTSTEPSLLENIIWDaN 
F 


811 


2161 


A 


6627 


18 


3367 


LEGSLNTERAKYYLTnMHHl'TVlXVEDPEEG 

AAASISQEPSLADKARIQDSDEPDLSQNSrro 

EHSQLLDDGHKKARNAYLNNSNYEEODEYF 

DKNIj\Ij™ENnyna'KVSSIXNRMANYTNLT 

QGAKEHEEAENITEGKKKPTKTPQMGTFMG 

VYLPCLQNIFGVILFLRLTWWGTAGVLQAF 

AIVLICCCCTMLTAISMSAIATNGWPAGGSY 

FMISRALGPEFGGAVGLCFYLG1TFAAAMYIL 

QAIEIFLVYIVPRAAIFHSDDALKESAAMLNN 

MRVYGTAFLVLMVLWnGVRYVNKFASLFL 

ACVTVSILAIYAGAIKSSFAPPHFPVCMLGNRT 

LSSRHIDVCSKTKEINNMrVPSKLWGFFCKSS 

QFFNATCDEYFVHNNVTSIQGIPGLASGirrEN 

LWSNYLPKGEIIEKPSAKSSDVLGSIJfflEYVL 

WrrrSFTLLVOIFFPSVraiMAOSNRSGDLKD 

AQKSfflGTILAlLTTSFVYLSNWLFGACIEGV 

VLRDKFGDAVKGNLWGTLSWPSPWVIVIGS 

FFSTCGAGLQSLTGAPRLLQAIAKDNHPFLRV 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ED 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 

nucleotide 
location 
correspondi 
ng to first 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Aiaiiine OCysteine, 
D=Aspartic Acid, B=Glutamic Acid, 
F^Pbenylalanine, G=Glycine, H=Histidine, 
I<=Isoleucine, K=Lysinc, Lp'Lcucine, 
M^^flethionine, N=A^aiagine, P^Proiinc, 
Q=Glirtamine, R=Aiginine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y=^yrosine, X^nknown, *=Stop codon, 
^^ssible nucleotide deletion, V=possible 
nucleotide insertion 














FGHSKANGEPTWAUXTAAIAELGILIASLDL 

VAPE-SMFFLMCYLFVNLACALQTLLRTPNW 

RPRFRYYHWALSFMGMSICLALMFISSWYYA 

rVAMVIAGMTYKYIEYQGAEKEWGDGIRGLS 

LSAARFALLRLEEGFPHTKNWRPQLLVLLKL 

DEDLHVKHPRLLTFASQLKAOKOLTIVGSVIV 

GNFI^NYGEALAAEQTKHLMEAEKVKGFCQ 

LVVAAKLKEGISHUQSCGLGOMKHNTVVM 

GWPNGWRQSEDARAWKmGTVRVTTAAHL 

ALLVAKNISFFPa^VEQFSEGNIDVWWIVHDG 

GMLMLLPFLLK\QHKVWRKCSIRFF\TVAQLE 

DNSIQMKKDLATFLYHLRIEAEVEWEMHDS 

DISAYTYERTLN4MEQRSQMLRHMRLSKTER 

DREAQLVKDRNSMLRLTSIGSDEDEETETYQ 

EKVHMTWTKDKYMASRGQKAKSMEGFQDL 

LNMRPDQSNVRRMHTAVKLNEVIVNKSHEA 

KLVLLNMPGPPRNPEGDENYMEFLEVLTEGL 

ERVLLVRGGGSEVmYS 


812 


2162 


A 


6628 


66 


640 


AVCl'MSEMAELSELYEESSDLQMDVMPGEG 

DLPQMEVGSGSRELSLRPSRSGAQQLEEEGP 

MEEEEAQPMAAPEGKRSLANGPNAGEQPGQ 

VAGADFESEDEGEEFDDWEDDYDYPEEEQLS 

GAGYRVSAALEEADKMFLRTREPALDGQFQ 

MHYEKTPFDQLAFIEELFfiLMVVNRL'reELG 

CDEIIDRE 


813 


2163 


A 


6630 


708 


1355 


AKMGAYKYIQELWRKKQSDVMRFLLRVRC 

WQYRQLSALHRAPRPTRPDKARRLGYKAKQ 

GY/VYIYIGFVFAVIYRIRVRRGGRKRPVPKO 

ATYGKPVHHGVNQLKFARSLQSVAEERAGR 

HCGALRVLNSYWVGEDSTYKFFEVILIDPFHK 

AIRRNPDTQWrnCPVHKHREMRGLTSAGRKS 

RGLGKGHKFHHTIGGSRRAAWRRRNTLQLH 

RYR 


814 


2164 


A 


6635 


201 


1705 


KOTEMNKSRWQSRRRHGRRSHQQNPWFRLR 

DSEDRSDSRAAQPAHDSGHCaJDESPSTSSGT 

AOTSSVPELPGFYFDPEXKRYFRLLPGHNNCN 

FLTKESIRQKEMESKRLRLLQEEDRRKXIARM 

GFNASSMLRKSQIXSFU^VTNYCHLAHELRLS 

CMKUCKVQIRSMDPSALASDRFNLILAIXrNS 

DRLFTVNDVTVQOSKYGIINLQSLKTITLKVF 

hfflENLYFTNRKVNNSVCWASLNHLDSHILLC 

LM(a.AETPGCATLLPASLFVNSHPAGIDRPG\ 

MLCSFRIPGAWSCAWSLNIQANNCFSTGLSR 

RVLLTNWTGHRQSFGTNSDVLAQQFALMA 

PLLFNGCRSGEIFAIDLRCGNQGKGWKATRLF 

HDSAVTSVRILQDEQYLMASI»1AGKIKLWD 

LRTTKCVRQYEGHVNEYAYLPLHVHEEEGIL 

VAVGQDCYTRIWSLHDARLXRTIPSPYPASKA 

DIPSVAFSSRLGGSRGAPGLLMAVGQDLYCY 

SYS 


815 


2165 


A 


6643 


659 


3282 


NKNBLEVPSARTTRIMGDHLDliLGWLMAG 

PVFOIPSCSFIXjRIAFYRFCNLTQVPQVLKrTE 

RLLLSFNYIRTVTASSFPFLEQLQLLELGSQYT 

PLTTOKEAFRNLPNLRILDLGSSKIYFLHPDAF 

QGLFHLFH-RLYFOH^DAVLKDGYFRNLKA 

LTRLDLSKNQIRSLYLHPSFGKLNSLKSIDFSS 

NQIFLVCHIELEPLQOKTLSFFSLAANSLYSR 

VSVDWGKCMNPFRNMVLEILDVSGNGWrV 

DITGNFSNAISKSQAFSLILAHHIMGAGFGFHN 

IKDPDQNnFAGLARSSVRHLDLSHGFVFSLNS 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
Nftof 
pqitide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

b^inning 

nucleotide 

location 

correspondi 

ngtoiiist 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
cone^nding 
to last amino 
acid residue 
of peptide 
sequence 


Amino,acid sequence (A=Alanine C=Cysteine, 
D-Aspartic Acid, E^Olutamic Acid, 
F=4Tienylalanine, 0=Glycine, H=Histidine, 
I=Isoleucine. K=Lysine, L=CeaciDe, 
M=^ediicnine, N^A^aiagine, P=Pn)linei 
Q=Ghitamine, R=Aiginine, S=Serine, 
T^lTireomne, V=Valine, W=Tryptophan, 
Y=Tyrosine, X>=Unknown, *-Btop codon, 
/=^ssible nucleotide deletion, V=possible 
nucleotide Insertion 














RVFETLia)lJCVLJ>4LAYhIKINKIADEAFyGLD 

NLQVLNLSYNLLGELYSSNFYGLPKVAYIDL 

QKNHL\nQDQTFKnJEKLQTLDLRDNALTnH 

FIPSIPDIFLSGNKLVTLPKINLTANLIHLSENR 

LENLDILYFLLRVPHLQILILNQNRFSSCSGDQ 

TPSENPSLEQLFLGENMLQLAWETELCWDVF 

EGLSHLQVLYLNHNYLNSLPPGVFSHLTALR 

GLSLNSNRLTVLSHNDLPANLEILDISRNQLL 

APNPDVFVSLSVLDITHWKFICECELSTFINWL 

GCDEEEVIJCSLKFSLFWCrrVTLTLFLMTILTV 

TKFRGFCFICYKTAQRLVFKDHPQGTEPDMY 

KYDAYLCFSSKDFTWVQNALLKHLDTQYSD 

QNIUT^LOTEJaRDFVPGENRPVANIQDAIWNSR 

KIVCLVSRHFLRDGWCI£AFSYAQGRCLSDL 

NSALIMVWGSLSQYQLMJCHQSIRGFVQKQQ 

YLRWPEDLQDVGWFLHKLSQQILKKEKEKK 

KDNNIPLQTVATIS 


816 


2166 


A 


6646 


1 


3811 


RDRAGVRPAGKQHAAAAFYDVGGDRPWDS 

GNTQLPPRNPVKANAMFGAGDEDDTDFLSPS 

GOARLASLFGLDQAAAOHGNEFFQYTAPKQP 

KKGQGTAATGNQATPKTAPATMSTPmVAT 

AVHAYRYTNGQYVKQGKFGAAVLGNHTTR 

EYKILLYISQCJQPVTVARIHVNFELMVRPNNY 

STFYDDQRQNWSIMFESEKAAVEFNKQVCIA 

KCNSTSSLDAVLSQDLIVADGPAVEVGDSLE 

VAYTGWLFQNHVLGQVFDSTANKDKLLRLK 

LGSGKVIKGWEDGMLGMKKGGKRLLIVPPA 

CAVGSEGVIGWTQATDSILVFEA^VRRVKIA. 

KDSGSDGHSVSSRDSAAPSPIPGADNLSADPV 

VSPPTSIPFKSC^ALRTKSNSLSEQLAINTSPD 

AVKAKLISRMAKMGQPMLPILPPQLDSNDSEI 

EDVNTLQGGGQPWTPSVQPSLQPAHPALPQ 

MTSQAPQPSVTOLQAPSAALMQVSSLDSHSA 

VSGNAQSFQPYAQMQAYAYPQASAVTSQLQ 

PVRPLYPAPLSQPPHFQGSGDMASFLMTEAR 

QHNTEtRMAVSKVADKMDHLMTKVEELQKH 

SAGNSMLIPSMSVTMETSMIMSNIQRnQENER 

LKQEILEKSNRIEEQNDKISEUERNQRYVEQS 

NIA4MEKRNNSLQTATENTQARVLHAEQEKA 

KVTEELAAATAQVSHLQUCMTAHQKKETEL 

QMQLTESLKETDLLRGQLTKVQAKLSELQET 

SEQAQSKFKSEKQNRKQLELKVTSLEEELTDL 

RVEKESLEKNLSERKKKSAQERSQAEEEIDEI 

RKSYQEELDKLRQLLKKTRVSTDQAAAEQLS 

LVQAHLQTQWEAKCEHLLASAKDEHLQQYQ 

EVCAQRDAYQQKLVQLQEKSVCFA\CLALQA 

QITALTKQNEQHIKELEKNKSQMSGVEAAAS 

DPSEKVKKIMNQVFQSLRREFELEESYNGRTI 

LGTIMNTIKMVTLQLLNQQEQEKEESSSEEffi 

EKAEERPRRPSQEQSASASSGQPQAPLNRERP 

ESPMVPSEQWEEAVPLPPQALTTSQDGHRR 

KGDSEAEALSEKDGSLPPELSCIPSHRVLGPP 

TS1PPEPLGPVSMDSECEESLAASPMAAK\PDN 

PSGK\VCVSBVAPDGPLQESSTRLSLTS\DPEE 

GDPLALGPESPGEPQPPQLKKDDVTSSTGPHK 

ELSSTEAGSTVAGAALRPSHHSQRSSLSGDEE 

DELFKGATLKALRPKAQPEEEDEDEVSMKGR 

PPPTPLFGDDDDDDDIDWLG 


817 


2167 


A 


6649 


63 


1073 


FFRSSSDNGSPIRQYE/HSTPAHQGPVMGLEG 
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SEQED 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

be^nning 

nucleotide 

location 

crarespondi 

Dg to first 

amino add 

residue of 

pq)tide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C5=Cysteinei, 
D=Aspartic Acid, EKilutamic Acid, 
F=Phenylalanine, GKjlycinei, H==Histidine, 
I=Isoleucine, K=Lysin6, LHLeucine^ 
M=M^oiiine, N=Asparagine, P=ftoline. 
QKihitamaie, R=Argiiiine, S=Serine, 
T=Tlireonine, V=Valine, W=Tiyptophan, 
Y=iyosinei X=Unknown, *=Stop codon, 
^i)Ossible nucleotide deletion, V^ssible 
nucleotide insertion 














KS/AKNSQLRIVLVGKTGAGKSATGNSILGKK. 

VEHSGTAAKSmCKCEKRSSSWKETELVWD 

TPGIFDTEVPNAETSKEIIRCILLTSPGPHALLL 

WPLGRYTEEEHKATEKILKMFGERARSFMIL 

IFTRKDDLGDTNLHDYLRBAPEDIQDLMDIFG 

DRYCALNNKATGAEQEAQRAQLLGUQRW 

RENKEGCYTNRMTQRAEEEIQKQTQAMQEL 

HRVELEREKARIREEYEEKIRKLEDKVEQEKR 

KKQMEKKLAEQEAHYAVRQQRARTEVESKD 

GELELIMTALQIASFILLRLFAED 


818 


2168 


A 


6660 


357 


1890 


APSGSWTRWLTLDPCSLRSRSPRSLLDPGMP 

GISARGLSHEGRKQLAVNLTRVLALTOSILDA 

YIIEFFMDNLWDTLPCSWQEALDGLKPPQLA 

TMLXGMPGEGEVVRYRSVWPLTLLALKSTA 

CAtAFTRMPGFQTlOTFLENPSQSSRLTAPFR 

KHVRPKKQHEIRRLGELVKKLSDPT/GLHPGC 

RRGLRPG\HLSRFMALGL<HJVIVKS1EGDQRL 

A^ERAQKLDQELLQALEKEEKKNPQWQTSPR 

HSPHHVVRWVDFTALCEELLLPLENPCQGRA, 

RLLLTGLHACGNDLSVAIXRHFSCCPEWALA 

SVGCCYMKLSDPGGYPLSQWVAGLPGYELP 

YRLREGACHALEEYAERLQKAGPGLRTHCY 

RAALBTVIRRARPELRRPGVQGIPRVHELiaEE 

YVQRGLQRVGU3PQU»LNLAALQAHLAQEN 

RWAFFSLALLLAPLVETLILLDRLLYLQEQA 

LSPVGFHAELLPIFSreLSPRNLVLVATKMPLG 

QALSVLETEDS 


819 


2169 


A 


6661 


65 


2686 


SGSGHCLAEAASMGPWGWKLRWTVALLLA 

AAGTAVGDRCHINEFQCQDGKCISYKWVCD 

GSAECQDGSDESQETCLSVTCKSGDFSCGGR 

VNRCIPQFWRCDGQVDCDNGSDEQGCPPKTC 

SQDEFRCHDGKCISRQFVCDSDRDCLDGSDE 

ASCPVLTCGPASFQCNSSTCIPQLWACDNDPD 

CH5GSDEWPQRCR0LYVFQGDSSPCSAFEFH 

CLSGECIHSSWRCDGGPDCKDKSDEe«ICAVA 

TCRPDEFQCSDGNC3HGSRQCDREYDCKDMS 

DEVGCVNVTLCEGPNKFKGHSGECTTLDKVC 

NMARDCRDWSDEPDCECGTNECLDNNGGCS 

HVCNDLKIGYECLCPDGFQLVAQRRCEDIDE 

CQDPDTCSQLCVNLEGOYKCQCEEGFQLDPH 

TKACKAVGSIAYLFFrNRHEVRKMTLDRSEY 

TSUPNLRNVVALDTEVASNRIYWSDLSQRMI 

CSTQLDRAHGVSSYDTVISRDIQAPDGLAVD 

WMSNIYWTDSVUm^SVADTKGVKRKTLFR 

ENGSKPRAIWDPVHGFMYWTDWGTPAKK. 

KGOLNOVDIYSLVTENIQWPNOITLDLLSGRL 

YWVDSKUISISSIDVNGGaWKTILEDEKRLAH 

PFSLAVFEDKVFWTDIINEAIFSANRLTGSDV 

NLLAENLLSPEDMVLFHNLTQPRGVNWCERT 

TLSNGGCQYLCLPAPQINPHSPKFrCACPDGM 

LLAR\DMRSCLTEa\EAAVATQETSTVRLKVS 

STAWTQHTTTRPVPDTSRLPGATPGLTTVH 

VrMSHQALQDVAG\RGN\EKKPSSVRALSIVL 

PIVUiVFLCLGVFLLWKNWRLKNINSINFDNP 

VYQKTTEDEVHICHNQDGYSYPSRQMVSLED 

DVA 


820 


2170 


A 


6666 


17 


4146 


ERGISSQIKGMKSOSGGOSPTSLWGLLFLSAA 
LSLWPTSGEICGPGIDIRNDYQQLKRLENCTVI 
EGYLHILUSKAEDYRSYRFPKLTVITEYLLLF 
RVAGLESLGDLFPNLTVIRGWKLFYNYALVIF 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 

nenftrfp 

seq- 
uence 


Met 
hod 


SEQ 
ID NO: 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

concspondi 

ng to first 

amino add 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
bication 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
INAspartic Acid, E=Glutamic Acid, 
P=fhenylalanine, G=Glycine, H-Histidine, 
I-Isoleudne, K=l,ysine, LHLcucinc, 
M=Metliionine, N=AspaiBgine, P^Proline, 
Q=GIutamine, R=Aiginine, S=^erine, 
T=Tlireonine, V=Valine, ■W=Tiyptophan, 
Y=Tyit)sine, X=Unlcnown, *=Stop codon, 
/=^>ossible nucleotide deletion, V=possible 
nucleotide insertion 














EMTNLKDIC5LYNLRNlTRG\AnUEKNADLCYL 

STVDWSLILDAVSNNYTVGNKPPKECGDLCP 

GTMEEKPMCEKTTINNEYNYRCWTTNRCQK 

MCPSTCGKRACTENNECCHFECLGSCSAPDN 

DTACVACRHYYYAGVCVPACPPNTYRFEaw 

RCVDRDFCANILSAESSDSEGFVIHDGECMQE 

CPSGFIRNGSQSMYCIPCEGPCPKYCEEEKKT 

KTIDSVTSAQMLQGCnFKGNLLINIRRGNNIA 

SELENFMGLIEWTGYVKIRHSHALVSLSFLK 

NLRLILGEEQLEGNYSFYVLDNQNLQQLWD 

WDHRNLTIKAGKMYFAFNPKLCVSEIYRMEE 

VTGTKGRQSKGDINTRNNGERASCESDVLHF 

T SlTl'SKNRIirrWHRYRPPDYRDLISFTVYYK 

EAPFKNVTEYDGQDACGSNSWNMVDVDLPP 

NKDVEPGILLHGLKPWTQYAVYVKAVTLTM 

VENDHDRGAKSEILYIRTNASVPSIPLDVLSAS 

NSSSQLIVKWNPPSLPNGNLSYYIVRWQRQP 

QDGYLYRHNYCSKDKIPIRKYADGTIDIEEVT 

ENPKTEVCGGEKGPCCACPKTEAEKQAEKEE 

AEYRKVFENFLHNSIFVPRPERKRRDVMQVA 

NTTMSSRSRNTTAADTYNITDPEELBTEYPFF 

ESRVDNKERTVlSNLRPFTLYRIDIHSCafflEAE 

KLGCSASNFVFARTMPAEOADDEPGPVTWEP 

RPENSIFLKWPEPENPNGLILMYEIKYGSQVE 

DQRECVSRQEYRKYGGAKLNRLNPGNYTARI 

QATSLSGNGSWTDPVFFYVQAKRYENFmui 

ALPVAVLLIVGGLVnULYVFHRKKNNSRLGN 

GVLYASVNPEYFSAADVYVPDEWEVAREIOT 

MSRELGQGSFGMVYEGVAKGWKDEPETRV 

ADCTVNEAASMRERIEFLNEASVMKEFNCHH 

WRLLGWSQGQPTLVIMELMTRGDLKSYLR 

SUIPEMENNPVLAPPSLSKMIQMAGEIADGM 

AYLNANKFVHRDLAARNCMVAEDFTVKIGD 

FGMTRDIYETDYYRKGGKGLLPVRWMSPESL 

KDGVFTTYSDVWSFGWLWEIATLAEQPYQ 

OLSNEQVLRFWMEGOLLDKPDNCPDMLFHL 

MRMCWQYNPKMRPSFLEnsSIKEEMEPGFRE 

VSFYYSEENKLPEPEELDLEPENMESVPLDPS 

ASSSSLPLPDRHSGHKAENGPGPGVLVLRASF 

DERQPYAHMNGGRKNERALPLPQSSTC 


821 


9171 


A 


D07I 




82S 


GRVLFRGCGVGHKGQVLMOTFILAQDWLSE 

SNHVFCVSSMLRLQKRLASSVLRCGKKKVW 

IJ3Pl>nETNEIANANSRQQIRKUKDGLIIRKPVT 

VHSRARCRKNTTLARRKGRHMGIGKRKGTAN 

ARMPEKVTWMRRMRILRRLLRRYRES/KRYR 

ESKKIDRHMYHSLYLKVKGNVFKNKRILMEH 

nmJKADKARKKLLADQAEARRSKTKEARK 

RREERLQAKKBEIIKTLSKEEETXK 


g22 


9179 


A 


0/ 1j 




21 


DFRPGLLLPRKXKMFGFHKPKMYRSJKGCVCI 
SGAKSSSSVRFTDSKRYEiaDFQ\SCFGLHETR\ 
SGDI\CNA\CVU-\LKRWKKLPAGSKKVNWNH 
VVDARAGPS\LKTrLKPKKVKTL\SGNRIK\ST 
OISKLOKEFKRVHNSDAHSTTS\SASP\Ansi>I R 

TVNQFRWTGSDTGVGFPGSNBNHPVFSFLDLX 
TYWKRQKICCGIMYKGRFGEVUDraLFKPCC 
SNKKA\AAEKPEEQGPEPLPISTQEWVTEVFM 


823 


2173 


A 


6727 


3 


4063 


PYLATLQLDSSLLIPPKYQTPPAAAQGQATPQ 
NAGPLAPNGSAAPPAOSAFNPTSNSSSTNPAA 
SSSASGSSVPPVSSSASAPGISQISTTSSSGFSGS 
VGGQNPSTGGISADRTQGNIGCGGDTDPGQS 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 

©NO: 

in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspond! 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
coiresponding 
to last amino 
add residue - 
of pqjtide 
sequence 


Amino acid sequence (A^Alanine 0==Cysteines, 
]>-Aspaitic Acid, E=01utaniic Acid, 
F^henylalanine, G=Glycine, H=Histidinei 
Wsoleucine, K=Lysine, IM^ucine, 
K^Methionine, >NAspaiagine, P=ProIine, 
Q=<Jlutamine, R=Afginine, S=Serine, 
T=Threonine, V=Valine, W=Tiyptophan, 
Y'=Tyrosine, X=Unknown, *«Stop codon, 
/=^ssible nucleotide deletion. V=possibIe 
nucleotide insertion 














SSQPSQDGQESNVPSVGSLADPDYLNTPQMN 

TPVTLNSAAPASNSGAGVLPSPATPRFSVPTP 

RTPRTPRTPRGGGTASGQGSVKYDSTDQGSP 

ASTPSTTRPLNSVEPATMQPIPEAHSLYVTLIL 

SDSVMNIFKDRNFDSCaCACNMNIKGADVO 

LYIPDSSNHDQYRCTCGFSAIMNRKLGYNSGL 

FLEDELDIFGKNSDIGQAAERRIivlVMCQSTFL 

PQVEOTKKPQEPPISLLLLLQNQHTQPFASLN 

FLDYISSNNRQTLPCVSWSYDRVQADNNDY 

WTECFNALEQGRQYVDNPTGGKVDEALVRS 

ATVHSWPHSNVUJISMLSSQDWRMLLSLQP 

FLQDAIQKKRTGRTWENIQHVQGPLTWQQFH 

XMAGRGTYGSEESPEPLPIPTLLVGYDKDFLT 

ISPFSLPFWERLLLDPYGGHRDVAYIWCPEN 

EALLEGAKTFFRDLSAVYe^iCRLGQHKPICK 

VLRDGIMRVGKTVAQKLTDBLVSEWFNQPW 

SGEENDNHSRLKLYAQVOIHHLAPYLATLQL 

DSSLLlPPkYQTPPAAAQGQATPGNAGPLAPN 

GSAAPPAGSAFNPTSNSSSTNPAASSSASGSSV 

PPVSSSASAPOISQISTTSSSGFSGSVGGQNPST 

GGISADRTQGNIGCGGDTDPGQSSSQPSQ0O 

QESVTERERIGIPTEPDSADSHAHPPAWrYM 

VDPFTYAAEEDSTSGNFWLLSLMRCYrEMLD 

NLPEHMKNSFUXJIVPCQYMLQTMKDEQVFY 

IQYLKSMAFSVYCQCRRPLPTQIHIKSLTGFGP 

AASIEMTLKNPERPSPIQLYSHTILAPIKDKQT 

ELGETFGEASQKYNVLFVGYCLSHDQRWLL 

ASCTDLHGELLETCWNIALPNRSRRSKVSAR 

KIGLQKLWEWCIGIVQMTSLPWRWIGRLGR 

LGHGELKDWSILLGECSLOnSKiaJCDVCRM 

CGISAADSPSILSACLVAMEPQGSFWMPDAV 

TMGSVFGRSTALNMQSSQLNTPQDASCTHIL 

VFPTSSnQVAPANYPNEDGFSPNNDDMFVDL 

PFPDDMDNDIQILMTGNLHSSFNSSPVPSPGSP 

SGIGVGSHFQHSRSQGERLLSREAPEELKQQP 

LALGYFVSTAKAENLPQWFWSSC3»QAQN\QC 

PLFLKASLHHHSVAQTDELLPARNSQRVPHP 

IJOSKTreOVLRFVLEQYNALSWLTCNPATQD 

RTSCXPVHFWLTQLYNAIMNIL 


824 


2174 


A 


6732 


2440 


36S 


VEEGLQRRRTPPGGRRGPVTPARPGPDSVRR 

RLLPPSSAAAFSSHRHNLLCSRRRGGGGGGG 

GGGGGTKRPGITGPTAATSPSGEPGNAASAP 

LSLLSPFPGQTTYQHPGVAEPSAYGGR0VAC 

ASLVFGRLQHRGGDRKRGLLGRSSGDAASD 

QPFRCRSQSTAORLVKQMDFTEAYADTCSTV 

GLAAREGNVKVLRKLLKKGRSVDVADNRQ 

WMPEHEAAYHNSVECLQMLINADSSENYIKM 

KTFEGFCALHLAASQGHWKIVQILLBAGADP 

NATTLEETTPLFLAVENGQIDVIRLLLQHGAN 

VNGSHSMCXSWNSLHQASFQENAEIIKLLLRK 

GANKECQDDFGrrPLFVAAQYGVKLESLXSDLIS 

SGVANVNCQALDKATPLFIAAQEGHTKCVELL 

LSSGADPDLYCNEDSWQLPMAAAQMGHTKl 

LDLLIPLTNRACDTGLNKVSPVYSAVFGGHE 

DCLEILLRNGYSPDAQACLVFGFSSPVCMAFQ 

KIXXFFGIVNILLKYGAQINELHLAYCLKYEK 

FSIFRYFLRKGCSLGPWNHIYEFVNHAIKAQA 

KYKEWLPHLLVAGFDPULLCNSWIDSVSIDT 

UFTLEFTNWKTLAPAVERMLSARASNAWIL 

QQHIATVPSLTHLCRLEIRSSLKSERLRSDSYIS 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

coiiespondi 

ngtofiist 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
coiresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino add sequoice (A=Alanine C=Cysteine, 
D=Aspartic Acid, ENSlutamic Add, 
F=Phenylalanine, G=Glycine, H^Histidine, 
I=-IsoIeucine, K=Lysine, I^Leucine, 
M=Methionine, N=Asparagine, P^Proline, 
Q=0]utaniine^ R=Arginiiie, S=Serine, 
T=Threonine, V=VaIine, W=TiyptDphan, 
Y=Tyrosine, X=Unknown, •=Stop codon, 
/=possible nucleotide deletion, \=possible 
nucleotide insertion 














QLPLPRSLHNYLLYEDVLRMYEVPELAAIQD 
G 


825 


2175 


A 


6735 


277 


1252 


RIMGLFDRGVQMLLTTVOAFAAFSLMTIAVG 

TDYWLYSRGVCKTKSVSENETSKKNEEVMT 

HSGLWRTCXXEGNFKGLCKQIDHFPEDADYE 

ADTAEYFLRAVRASSIFPILSVILLFMQGLCIA 

ASEFYKTRHNIILSAGIFFVSAGLSNnGIIVYIS 

ANAGDPSKSDSKKNSYSYGWSFYFGAJLSFIIA 

EMVGVLAVHMFIDRHKQLRATARA\TT>YLQ 

ASAITRIPSYRYRYQRRSRSSSRSTEPSHSRDA 

SPVGIKGFNTLPSmSMYTLSRDPLKAATTPT 

ATYNSDRDNSFLQVHNCIQKENKDSLHSNTA 

NRRTTPV 


826 


2176 


A 


6744 


3 


5177 


SDDLRTGLFQDVQDAESLKLPGVYEVLFYNB 

TEDCPGMMLWRYPEPRGLTLVRITPVPFNTT 

EDPDISTADLGDVLQDPCSLEYWDELQKVFV 

AFREFNLSESKVCELQLPDINLVNDQKKLVSS 

DLWRIVLNSSQNGADDQSSASESGSQSTCDPL 

VITTALAACTRVDSCFTPWFVPSLCVSFQFAH 

LHTtlLCHHLDQLGTAAPQ'yLQJTVSDRNMPS 

ELEYMIVSFREPHMYLRQWNNGSVCQEIQFL 

AQADCKLI^CRNVTMQSWKPFSIFGQMAVS 

SDVVEKLLDCTVIVDSVFVNLGQHVVHSLNT 

AIQAWQQNKCPEVEELVFSHFVICNDTQETL 

RFGQVDTDENDLLASLHSHQYSWRSHKSPQL 

LHICIEGWGNWRWSEPFSVDHAGTFIRTIQYR 

GRTASLinCVQQLNGVQKQinCGRQnCSYLSQ 

SIELKWQHYIGQDGQAWREHFDCLTAKQK 

LPSYILENNELTELCVKAKGDEDWSRDVCLE 

SKAPEYSIVIQVPSSNSSnYVWCTVLTLEPNS 

QVQQRMIVFSPLFIMRSHLPDPniHLEKRSUjL 

SETQDPGKGQEKPLQNIEPDLVHHLTFQAREE 

YDPSDCAVPISTSLKQIATKVHPGGTVNQILD 

EFYGPEKSLQPIWPYNKKDSDRNEQLSQWDS 

PMRVKLSIWBCPYVRTLLIELLPWALLINESKW 

DLWLFEOEKIVLQVPAGKIIIPPNFQEAFQIGIY 

WANTNTVHKSVAIKLVHNLTSPKWKDGGNG 

EWTLDEEAFVDTEIRLGAFPGHQKLCQFCIS 

SMVQQGIQIIQIEDKTTIINNrPYQIFYKPQLSV 

CNPHSGKEYFRVPDSATFSICPGGEQPAMKSS 

SLPCWDLMPDISQSVLDASLLQKQIMLOFSPA 

PGADSSQCWSLPATVRPEFPRQSVAVPLGNFR 

ENGFCTRAIVLTYQEHLGVTYLTLSEDPSPRV 

nHNRCPVKMLIKENIKDIPKFEVYCKKlPSECS 

IHHELYHQISSYPDCKTKDLLPSLLLRVEPLDE 

VTTEWSDAIDINSQGTQWFLTGFGYVYVDV 

VHQCGTVFnVAPEOKAOPILTT«m>JRAPEKIV 

TT/KMFITQLSLAVFDDLTHHKASAELLRLTL 

DNIFLCVAPGAGPLPGEEPVAALFELYCVEIC 

CGDLQLDNQLYNKSNFHFAVLVCQGEKAEPl 

QCSKMQSLLISNKELEEYKEKCFIKLCrrLNEG 

KSILCDINEFSFELKPARLYVEDTFVYYIKTLF 

DTYLPNSRLAGHSTHLSGGKQVLPMQVTQH 

ARALVNPVKLRKLVIQPVNLLVSIHASLKLYI 

ASDHTPLSFSYFERGPIFTTARQLVHALAMHY 

AAGALERAGWWGSLDILGSPASLVRSIGNG 

VADFFRLPYEGLTRGPGAFVSGVSRGTTSFVK 

HISKGTLTSrrNLATSLARNMDRLSLDEEHYN 

RQEEWRRQLPESLOEQLRQOLSRLGISLLOAI 

AGIVDQPMQNFQKTSEAQASAGHKAKGVISG 



255 



wo 01/57188 



PCT/DSOl/03800 



SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/4% 
914 


Predicted 

beginning 

nucleotide 

location 

conespoodi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
add residue 
ofpqitide 
sequence 


Amino add sequence (A=Alanine C^Cysteine, 
D=Aspartic Acid, E^lutamic Acid, 
F^Phenylalanine, G==Glycine, IMCstidine, 
h=Isoleucine, K=Lysine, Lr=Leucine, 
M==Mcthioaine, N=Asparaginc, P^Proline, 
Q=01utamine, R=Aiginine, S=^erin^ 
T=Thrconine, V=Valine, W=TiyptDphan, 
Y=Tyrosine, X=Unknown, ♦=Stop oodon, 
/^possible nucleotide deletion, V=possible 
nucleotide insertion 














VGKGJMOVKrKPIGGAAELVSQTGYOILHGA 

GLSQLPKQRHQPSDWHADQAPNSUVKYVW 

KMLQSLGRPEVHMALDWLVRGSGQEHEGC 

LLLTSEVLFWSVSEDTQQQAFPVTEIDCAQD 

SKQNNLXTVQLKQPRVACaJVEVDGVRBRLSE ■ 

QQYNRLVDYTTKTSCHLAPSCSSMQIPCPVVA 

AEPPPSTVKTYHYLVDPHFAQVFLSKFTMVK 

NKALRKGFP 


827 


2177 


A 


6748 


2 


1662 


FVGAPRRGNPFGSPGNPGRHQGPCHRPRGTK 

ASGVSPTLWRPQAAATGLEMPSSGRALLDSP 

LDSGSLTSLDSSVFCSEGEGEPLALODCFTVN 

VGGSRFVLSQQALSCFPHTRLGKLAVWASY 

RRPGALAAVPSPLELCDDANPVDNEYFFDRS 

SQAFRYVIJHYYRTGRLHVMEQLCALSFLQEl 

QYWGIDELSXDSCCRDRYFRRKELSEIIDFKK 

DTEDQESQHESBQDFSQGPCPTVRQKLWNIL 

EKPOSSTAARIFGVlSnFVGVSnNMALMSAEL 

SWLDLOLLEILEYVCISWPTOFFVI RFI PVRn 

RaiFUUCWT^nDlXAnJFYlTLLVESLSGVSQT 

TQEL\ENVGAHCPGCLRLLRAL\RML1CAWGR 

HSTGLRSLGMTTTQCYEEVGLLLLFLSVGISIF 

STVBYFAEQSXPDTIFrSVPCAWWWATTSNfr 

TVGYGDIRPDTTTGKrVAFMCILSGILVLALPI 

AnhTORFSACyFrLKLKEAAVRQREALKKLTK 

NIATOSYISVNLRDVYARSIMEMLRLKGRER 

ASTRSSGGDDFWF 


828 


2178 


A 


6786 


5672 


1360 


GTHPASSGPVPLPPAAVSAAIREELGEPVPFV 

TASSGFQSMHSSNPKVRSSPSGNTQSSPKSKQ 

EVMVRPPTVMSPSGNPQLDSKFSNQGKQGGS 

ASQSQPSPCDSKSGGHTPKALPGPGGSMGLK 

NGAGNGAKGKGKRERSISADSaFDQRDPGTPN 

DDSDnCECNSADHDCSQDSQHTPHSMTPSNAT 

APRSSTPPHGQTTATEPTPAQKTPAKWYVFS 

TEMANKAAEAVLKGQVETIVSFHIQNISNNK 

TERSTAPLNTQISALRNDPKPLPQQPPAPANQ 

DQNSSQNTRLQPTPPIPAPAPKPAAPPRPLDRB 

SPGVENKLIPSVGSPASSTPLProGTGPNSIPN 

NRAVTPVSQGSNSSSADPKAPPPPPVSSGEPPT 

LGENPDGLSQEQLEHRERSLQTLRDIQRMLFP 

DEKEFTGAQSGGPQQNPGVLDGPQKKPEGPI 

QAMMAQSQSLGKGPGPRTDVGAPFOPQOHR 

DVPFSPDEMVPPSMNSQSGTIGPDHLDHMTP 

EQIAWLKLQQEFYEEKRRKPEQVWQQCSLQ 

DMMVHQHGPRGWRGPPPPyQMTPSEGWAP 

GGTEPFSDGINMPHSLPPRGMAPHPNMPGSQ 

MRLPGFAGMINSEMEGPNVPNPASRPGLSGV 

SWPDDVPKIPDORNFPPGQQIFSOPGRGERFP 

NPQGLSHEMFQQQLAEKQLGIJPPGMAMEGIR 

PSMEMNRMIPGSQRHMEPGNNPIFPRIPVEaP 

LSPSRGDFPKGIPPQMGPGRELEFGMVPSGM 

KGDVNLNVNMGSNSQMIPQKMREAGAGPEE 

MLKLRPGGSDMLPAQQKMVPLPFGEHPQQE 

YGMGPRPFLPMSQOPOSNSOLRNLREPIGPDQ 

RTNSRLSHMPPLPLNPSSNFTSLNTAPPVQRG 

LGRKPLDBVAGSQVHSPGINPLKSPTMHQVQ 

SPMLGSPSGNLKSPQTPSQLAGMLAGPAAAA 

SKSPPVLGSAAASPVHLKSPSLPAPSPGWTSS 

ICTPLQSPGIPPNHKAPLTMASPAMLGNVESG 

OPPPPTASQPASV>aPG\SLPSSTPYTMPPEPTL 

SQNPI^IM\MSR\MSKFAMVPS\SNPGYNHDAI 
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wo 01/57188 



PCTAJSOl/03800 



SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

b^inning 

nudeotide 

location 

conespondi 

ng to first 

amino add 

residue of 

pqjtidc 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
tolast amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alaiiine OCysteine, 
D?=Aspartic Acid, B=Glutasiic Acid, 
F=Pljenyla]anine, 0=Gfydae, H-Histidine, 
I=Isoleucine, K=l^ysine, L=Leucine, 
M^Metiiionine, N=Asparagine, P=Proline, 
Q=Glutainine, R=ATginine, S=Serine, 
T=Threoninei V=VaJine, W=Tiyptophan, 
Y=Tyrosine, X=Unknovsna, ♦=Stop codon, 
A^ossible nucleotide deletion, V=possible 
nucleotide insertion 














KTVASSDDDSPPARSPNLPSMNNMPGMGINT 

QNPRISGFNPWPMPTLSPMGMTQPLSHSNQ 

MPSPNAVGPNIPPHGVPMGPGLMSHNPIMGH 

GSQEPPMVPQGRMGFP<}GFPPVQSPPQQVPFP 

HNGPSGGQGSFPGGMGFPGEGPLGRPSNLPQ 

SSADAALCKPGGPGGPDSFTVLGNSMPSVFT 

DPDUJEVIRPGATGIPEFDLSRIIPSEKPSQTLQ 

YFPRGEVPGRKQPQGPGPGFSHMQGMMGEQ 

APRMGLALPGMGGPGPVGTPDIPLGTAPSMP 

GHNPMRPPAFLQQGMMGPHHRMMSPAQST 

MPGQPTLMSNPAAAVGMIPGKDRGPAGLYT 

HPGPVGSPGMMMSMQGMMGPVNRTS 


829 


2179 


A 


6797 


433 


3 


ASFFNFSICICKHLEVGPPVOHPAHDDVGGRH 

GPGGR/GSRSPRSLQCAPGGGRRSGCPAGSSP 

ASTCPPSPGGSGADRFGPSPPPPSREAAPTAG 

AAASSTSSGASC3'PVPASSRW0VRSRTRSGSG 

OEREPRDRPSERPRLV 


830 


2180 


A 


6800 


3 


1911 


LPERAFGPRTPRAPRRKRRRLLLSPPPRPPPPL 

DREPRAPGPWLCPSRAGTAQDPARIRERRGR 

VAGGAAGPAMELRARGWWLLCAAAALVAC 

ARGDPASKSRSCGEVRQIYGAKGFSSS\DVPQ 

AEISGEHLRICPQGrrCCTSEMffiNLANRSHA 

ELETALRDSSRVLQAMLATQLRSFDDHFQHL 

LNDSERTLQATFPGAFGELYTQNARAFRDLY 

SEIJRLYYRGANlJtt,EElXAEFWARLLERLFK 

QLH?QLLLPDDYUXXGKQAEALRPF\GEAP\ 

RELRLRAT\RA\FVAAR\SFVQGLGVAS\DVVR 

KVAQVPLGVPEOSRAVIEAGSYaALHCVOVP 

GARPCPDYCRNVLKGCLANQADLDAEWRNL 

UDSMVLITDKFWGTSGVESVIGSVHTWLAEA 

INALQDNRDTLTAKVIQGCGNPKVNPQGPGP 

EEKRRRGKLAPRERPPSGTLEKLVSEAKAQL 

RDVQDFWISLPGTLCSEKMALSTASDDRCWN 

GMARGRYLPEVMQDQLANQINNPEVEVDrr 

KPDMTIRQQIMQLKIMTNRLRSAYNQNDVDF 

QDASDDGSGSGSGDGCLDDLCGRKVSRKSSS 

SRTPLTHALPGLSEQEGQKTSAASCPQPPTCL 

LPLIXFLALTVARPRWR 


831 


2181 


A 


6808 


2 


1522 


ASRHGNfTPGAIXMLLQALCPPLAPOVRGSEA 

EGRLRHKLFSOYDSSVRPAREVGDRVRVSVG 

ULAQLISLNBKDEEMSTKVYLDLEWTDYRLS 

WDPAEHDGIDSLRITAESVWIPDWLLNNND 

GNFDVALDISVWSSDGSVRWQPPGIYRSSCS 

IQVTYFPFDWQNCTMVFSSYSYDSSEVSLQT 

GLGPDOQGHQEnnHEOTFDENGQWEKIHKPS 

RLIQPPGDPRGGREGQRQEVIFYLIIRRKPLFY 

LVNVIAPCnJniAIFVFnjPPDAGEKMGLSIF 

ALLTLTVFUiLADKVPETSLSVPniKYLMFr 

MVLVTFSVILSWVLNLHHRSPHTHQMPLWV 

RQIFIHKLPLYLRLKRPKPERDLMPEPPHCSSP 

GSOWGRGTDEYFIRKPPSDFLFPKFNRFQPEL 

SAPDLRRFEDGPNRAVALLPELREWSSISYIA 

RQLQEQEDHDALKEDWQFVAMVYDRLFLW 

TFIIFTSVGTL\VIFLDATYHLPPPDPFP 


832 


2182 


A 


6824 


71 


1079 


ETMAKNPPENCEDCHILNAEAFKSKKICKSLK 
ICGLVFOILALTLrVLFWGSKHFWPEVPKKAY 
DMEHTFYSNGEKKKIYMEIDPVTRTmERSGN 
GTDETLEVHDFKNOYTGIYPVGLQKCFIKTQI 
KVIPEFSEPEEEIDENiiEl'nil'i'liQSVIWVPAE 
KPIEtWDFIJa4SKILma)NVTMYW\INPTL\IS 
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wo 01/57188 



PCT/CSOl/03800 



SEQID 
NO: of 
nucl~ 
eotide 
seq- 
uence 


SEQID 
NO: of 

seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

b^inning 

nucleotide 

location 

conespondi 

ngtofiist 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
conesponding 
to last amino 
pcid residue 
of pqjtide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D=>Aspartic Acid, EF<}lutamic Acid, 
F=Rienylalamne, G=Glyclne, H=Histidine, 
I^lsoleudne, K=tysine, D=Leucine, 
M'^etluonine, N=Asparagine, P=Proline, 
Q=GlutanuDe, R=Aiginine, S=Serinei 
T=Threoninc, V=Valine, W=Tiyptophan, 
Y=Tyrosine, X=lInknown, *=Stop codon, 
/=possible nucleotide deletion, \ppossible 
nucleotide insertion 














GTFAKQLHHNFAFIILVSELQDFEEEGEDLHPP 

ANEKKGIEQNEQWWPQVKVEKTRHARQAS 

EEELPINDYTENGIEFDPMLDERGYCCIYCRR 

GNRYCRRVCEPLLGYYPYPYCYQGGRVICRV 

IMPCNWWVARMLGRV 


833 


2183 


A 


6846 


116 


602 


EAEOEQVCOAKCCGDAPHVENREEETARIGP 

GVMESKEERAUWUVENVNQENDEKDEKE 

QVANKGEPLALPLNVSEYCVPRGNRKRFRVR 

QPILQYRWDIMHRLGEPQARMREmMERIGE 

EVRQLMEKLREKQLSHSLRAVSTDPPHHDHH 

DEFQLMP 


834 


2184 


A 


6831 


3 


2024 


PNGVAIXHUXJAAVIPNTNYNIFQDALGGRSR 

GSREESPAPSRAPASASLWRRLWVEAKMAA 

HAAAAAQAAAAQAAHAEAADSWYLALLGF 

AEHFRTSSPPKIRLCVHCLQAVFPFKPPQRIEA 

RTHLQLGSVLYHHTKNSEQARSHLEKAWUS 

QQIPQFEDVKFEAASLLSELYCQENSVDAAKP 

LLRKAIQISQQTPYWHOUJUFQLAQLHTLEKD 

LVSACDLLGVGAEYARWGSEYTRALFLLSK 

GMLLLMBRKLQEVHPLLTLCGQIVENWQGN 

PIQKESLRVFFLVLQVTHYLDAGQVKSVKPC 

LKQLQOaQTlSTLHDDEILPSNPADLFHWLP 

KEHMCVLVYLVTVMHSMQAOYLEKAQKYT 

DKALMQLEKLKMLDCSPILSSFQVIIXEHIIM 

CaiLVTGHKATALQEISQVCQLCQQSPRLFSN 

HAAQUrrLLGLYCVSVNGMDNAEAQFTTAL 

RLTNHQELWAFIVTNLASVYIREGNRHQEWN 

LYSLLERINPDHSFPVSSHCLRAAAI'YVRGLF 

SFFQGRYNEAKRFLRETLKMSNAEDLNRLTA 

CSLVLLGHIFYVLGNHRESNNMWPAMQLAS 

KIPDMSVQLWSSALXRDLNKACGNAMDAHE 

AAQMHQNFSQQLLQDHIEACSLPEHNLrrWT 

DGPPPVQFQAQNGPNTSLASLL 


835 


2185 


A 


6855 


334 


1268 


PTRRPILPLTSPKAISVPSPLQOKQHTLVKSCL 

SVSGIOGFLVSLSSRMKLQTLAVSVTALKFWS 

AYVPCQTQDRDALRLTLEQIDLIRRMCASYSE 

LELVTSAKALNDTQKLACUGVEGGHSIDNS 

LSILRTFYMLGVRYLTLTHTCJITPWAESSAK 

GVHSFYNNISGLTDFGEKWAEMNRLGMMV 

DLSHVSDAVARRALEVSQAPVIFSHSAARGV 

CNSARNVPDDILQLLEEERWAFVMVSLFHGE 

LIQWQPKmCSTVADHFDHIKAVUGSKnGI 

GGDYDGAGKYRKKTTCKAPWRTSSRMSS 


836 


2186 


A 


6862 


315 


11 


PPRSRPSCWRKKVGPGRPWWWGGTGPPGQG 
RPEIRLLPLPMTGACGAVAASRTGSSGPG/SSL 
PNGHGGKGSGLANGLAGmGHLGLGSSFGT 
GPGSGRPPP 


837 


2187 


A 


6863 


2 


16IS 


VLRGQRGPAGGLAEERRRGRNEWRIHDVTT 
APFPGLVQRRSRLLIVSQVRYFLKNKVSPDLC 
KEDGLTALHQCCIDNFEEIVKLLLSHGANVN 
AKDNBLWTPLHAAATCGHINLVKILVQYGA 

QGUQEKINEMRVAPEQQMIADIHCMIAAGQ 

DLDWIDAQGA1LLHIAGANGYLRAAELLLDH 

GVRVDVKDWDGWEPLHAAAFWGQMQMAE 

LLVSHGANVLNARTSMDEMPIDLC^EEFKVL 

LLELIOHKHDVftlKSQLRHKSSLSRRTSHRQA 

S/SVGKVVRRTQPVGTGPNL\YRKEYE/GEEAI 

LWQRSA\AEDQRTSTYNGDIRETVRTDQENKD 
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wo 01/57188 



PCT/USOl/03800 



SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

oonespondi 

ngtofiist 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corre^xmdtng 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A^Alanine O^Cysteine, 
D=Aspartic Acid, EMShittimic Acid, 
FH>benylalaninei G=01ycinei H=HistjdiDe. 
I=IsoleiK:ine, K=Lysine, LHjeucine, 
M=MebioDine, N=Aspaiagine, P=Proline, 
Q=Glutamine, R=AiBinine, S=Serine, 
T^Hireonine, V^Valine, W=Tiyptophan, 
Y=TJn)sine, X-Unknown, *=Stop codon, 
A^ssible nucleotide deletion, \=pos5ible 
nucleotide insertion 














PNPRLEK\PVLLSEFPTKIPRGELDMPVENGLR 

APVSAYQYALANGDVWKVHEVPDYSMAYG 

NPGVADATPPWSSYKEQSPQTLLELKRQRAA 

AKLLSHPFLSTHLOSSMARTGESSSEGKAPLl 

GGRTSPYSSNGTSVYYTVTSGDPPLLKFKAPl 

EEMEEKVHGCCRIS 


838 


2188 


A 


6865 


6291 


739 


AGPLEPRVQGAMALQLWALTLLGLLGAGAS 

UMPRKLDFERSEKELNHLAVDEASGVVYLGA 

VNALYQLDAKLQLEQQVATGPVLDNKKCTP 

PIEASQCMEAHvnDNVNQLLLVDPPRKRLVE 

CGQLLKGIVCALRALSNISLRLFYEDGSGEKSF 

VASNDEGVATVGLVSSTGPGGDRVLFVGKG 

NGPHDNGUVSTRLLDRTDSREAFEAYTDHAT 

YKAGYLSTNTQQFVAAFED<a>YVFFVFNQQD 

KHPARNRTLLARMCREDFNYYSYLEMDLQC 

RDPDIHAAAFGTCLAASVAAPGSGRVLYAVF 

SRDSRSSGGPGAGLCLFPLDEVHAKMEANKN 

ACYTGTREARDIFYKPFHGDIQCGGHAPGSSK 

SFPCGSEHLPYPLGSRDGLRGTAVLQRGGLN 

LTAVTVAAENNHTVAFLOTSDGRILKVYLTP 

DGTSSBYDSILVEINKRVKRDLVLSGDLGSLY 

AMTQDKVFRLPVQECLSYPTCTQCRDSQDPY 

CGWC:WEGRCrrRKAEa»RAEEASHWLWSRS 

KSCVAVTSAQPQNMSRRAQGEVQLTVSPLPA 

LSBEDELLCLFGESPPHPARVEGEAVICNSPSS 

IPVTPPGQDHVAVnQLLLRRGNIFLTSYQYPF 

YDCRQAMSLEENLPCISCVSNRWTCQWDLR 

YHECREASPNPEDGIVRAHMEDSCPQFLGPSP 

LVIPNENHETDVNFQGKNLDTVKGSSLHVGSD 

IJLKFMEPVTNIQESGTFAFRTPKLSHDANETL 

PUILYVKSYGKNIDSKLHVTLYDCSFGRSDC 

SLCRAANPDYRCAWCGGQSRCVYEALCNTT 

SECPPPVmUQPETGPLGGGnUTILOSNLOVQ 

AGDIQRISVAGRNCSFQPERYSVSTRTVCVIEA 

AETPFTGGVEVDVFGKLGRSPPNVQFTFQQP 

m.SVEPQQGPQAGGTTLTIHGTHLDTGSQED 

VRVTLNGVPCKVTKFGAQLQCVTGPQATRG 

QMLLEVSYGGSPVPNPGIFFTYRENPVLRAFE 

PLRSFASGORSINVTOQGFSLIQRFAMWIAEP 

LQSWQPPREAESLQPMTWGTDYVFHNDTK 

WFLSPAVPEEPEAYMLTVLIEMDGHRAIXRT 

EAGAFEYVPDPTFENFTGGVKKQVNKLIRAR 

GTNLNKAMTLQBAEAFVGAERCTMKrLTET 

DLYCEPPEVQPPPBCRRQKRDTTHNLPEFIVKF 

OSREWVLCHIVEYDTRVSDVPLSLILPLVIVPM 

VWIAVSVYCYWRKSQQAEREYEKIKSQLEG 

LEESVRDRCKKEFTOLMIEMEDQTrNDVHEAG 

ffVLDYKTYTDRVFFLPSKDGDKDVMITGKL 

DEPEPRRPWBQALYQFSNLLNSKSFLINFIHT 

LVENQPEFSARAKVYFASIiTVALHGKLEYYT 

DIMHTLFLELLEQYWAKNPKLMLRRSETW 

ERNILSNWNISICLYQYLKDSAGEPLYKLFKAI 

KHQVEKGPVDAVQKKAKYTLNDTGLLGDD 

VBYAPLTVSVIVQDEGVDAIPVKVLNCDTISQ 

VKEKDDQVYRGQPCSCWPRPDSWLEWRPG 

STAQn.SDLDLTSQREGRWKRVNTLMHYNVR 

DGATULSKVGVSQQPEDSQQDLPGERHALL 

EEENRVWHLVRPTDEVDEGKSKRGSVKEKE 

RTKArrEIYLTRLLSVKGTLQQFVDNFFQSVL 

APGHAVPPAVKYFFDFLDEQAEKHNIQDEDTI 
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wo 01/57188 



PCTAJSOl/03800 



SEQ ID 
NO: of 
Dud- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


hod 


SEQ 
ID NO: 
in 

VSSN 
09/496 
914 


Predicted 
beginning 

U UVl cu uuc 

location 
concspondi 
ngtofijst 
amino acid 
residue of 

sequence 


Piedictedend 
nucleotide 
lucsmon 
corre^XNiding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
D=Aspartic Acid, GKjlutamic Acid, 
r— ruraiyiajanmc, o==\jlycuie^ £i=7listHime^ 
I<=Isoleucine, K=Lysine, LHLeucine; 
M=^ethioninc, N-=Aq)araginc, PHProlinci 
QOlutaraine, R=Arginine, S=Scrinei 
T=Threonine, V=VaIine, W=Tiyptophan, 
Y=Tyrosine, X=UDknowii, *=5top codon, 
^^possibic nucleotide deletion, ^ppossible 
nucleotide insertion 














HIWKTNSLPLRFWVNILKNPHFIFDVHVHEW 

DASLSVIAQTrmDACTRTEHKLSRDSPSNKLL 

YAKEISTYKKMVEDYYKGIRQMVQVSDQDM 

NTHLAEISRAHTDSLNTLVALHQLYQYTQKY 

YDEIINAUEEDPAAQKMQLAFRLQQIAAALE 

NKVTDL 


839 


2189 


A 


6872 


1 


1483 


RAKRLALQCHVCVCALTPOEQSGRRLPGQT 

WLMFSCFOFSLQDNSFSSTTVTTCDEDPVSLH 

EDQTDCSSLRDENNKENYPDAGALVEEHAPP 

SWEPQQQNVEATVLVDSVLRPSMGNFKSRKP 

KSIFKAESGRSHGESQETEHWSSQSECQVRA 

GTPAHESPQNNAFKCQETWRLXQPRIDQRTAT 

SPKDAFETR\QDLNEEEAAQVHGVKDPAP7^ 

TQSVLAXDGTDSADPSPVHKDGQNEADSAPE 

DLHSVGTSRLLl/YHrrDGDNPTAVRHGCSL/F 

SGQSQRFNLDPESAPSPPSTQQFMMPRSSSRC 

SCGDGKEPQTITQLTKHIQSLKRKIRKFEEKFE 

QEKKYRPSHGDKTSNPEVLKWMNDLAKGRK 

QLKELKLKLSEEQGSAPKGPPRNIXCEQFrVP 

RENGKPEAAGPEPSSSGEETPDAALTCLKERR 

EQLPPQEDSKVTKQDKNLKPLYDRYRIIKQIL 

STPSUPnVSQiyrCMLLLCTDV 


840 


2190 


A 


6873 


2 


2054 


FFRPYFSFIRLFAMSLADLTKTNlDHaTGVAL 

ENNRRSAACKRSPGTGDFSRNSNASNKSVDY 

SRSQCSCGSLSSQYDYSEDFLCDCSEKAINRN 

YLKQPWKEKEKKKYNVSKISQSKGQKEISV 

EKKHTWNASLFNSQIHMIAQRRDAMAHRILS 

ARLHKKGLKNELADMHHKLEAILTENQFLK 

QLQLRHLKAIGKYENSQN>flLrQIMAKHQNEV 

KNLRQLLRKSQEKERTLSRKLRETDSQLLKT 

KDILQALQKLSEDKNLAEREELTHKLSHrnC 

MDANDKKIQSLEKQLRLNCRAFSRQLAIETR 

KTLAAQTATXTLQVEVKHLQQKLKEKDREL 

EKMYSHRILKNLHDTEDYPKVSSTKSVQAD 

RKILPFTSMRHQGTQKSDVPPl/rnCGKKATG 

NroHKEKSTED«ffiIPHCVNKLPKQEDSKBKY 

EDLSGEEKHLEVQILLENTGRQKDKKHDQEK 

KNIFVKEEQELPPKIIEVIHPERESNQEDVLVR 

EKFKRSMQRNGVDDTVLCaCGTAPYTKGPLRQ 

RRHYSFTEATENLHHGLPASGGPANAGNMR 

YSHSTGKHLSNREEMELEHSVDSGYEPSFGKS 

SRKVKDTTFRDKKSSLMEELFGSGYVLKTD 

QSSPGVAKGSEEPLQSKESHPLPPSQASTSHA 

FGDSKVTWNSKPSSPTECaCRKni 


841 


2191 


A 


6874 


3 


2867 


SSRTREMEEKEILRRQIRLLQGLIDDYKTLHG 

NAPAPGTPAASGWQPPTYHSGRAFSARYPRP 

SRRGYSSHHOPSWRKKYSLVNRPPGPSDPPA 

DHAVRPLHGARGGQPPVPQQHVLHIQVQLS 

(JGQNVVDCVKPPSKSGSASASGAQRGSLEEFE 

DTPWSDQRPREGEGEPPRGQLQPSRPTRARG 

TCSVEDPLLVCQKEPGKPRMVKSVGSVGDSP 

REPRRTVSESVIAVKASFPSSALPPRTGVALG 

RKLGSHSVASCAPOLLGDRRVDAGHTDOPVP 

SGSVGGPARPASGPRQAREASLWTCRTNKF 

RKNNYKWVAASSKSPRVARRALSPRVAAEN 

VGKASAGMANKVEKPQLIADREPKPRKPATS 

SKPGSAPSKYKWKASSPSASSSSSFRWQSEAG 

SKDHASQLSPVLSRSPSODVRPALAHSGLKPLS 

GETPLSAYKVKTRTKIIRRRGSTSLPGDKKSG 

TSPAATAKSHLSLRKR.QALRGKSSPVLKKTPN 
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nucl- 
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seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 
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hod 


SEQ 
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in 
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ng to first 

amino acid 
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Predicted end 
nucleotide 
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corresjmnding 
to last amino 
add residue 
of peptide 
sequence 


Amino acid sequence (A'=Alanine O^Cysteine, 
I>Aq}artic Acid, E^lutamic Acid. 
F=41ienylalanine, G=Glycine, It=Histidine, 
l=IsoIeucine, K=Lystne, LM^cine, 
h^^ethioiiine,N=Asparagine, P=Proline, 
Q=Ghitamine, R=Aiginine, S=Serine, 
■p=Threonine, V-=Va]ine, W=Tryptophan, 
Y"Tyrosinc, X=Unknown, *-=Stop codon, 
/=possi!ale nucleotide deletion, \=possible 
nucleotide insertion 














KGLVQVTKHRU3RLPPSRAHLPTKEASSLHA 

VRTAPTSKVnCTRYRTVKKTPASPLSAPPFPLS 

LPSWRARRLSLSRSLVLNRLRPVASGGGKAQ 

PGSPWWRSKGYRCIGGVLYKVSANKLSKTSG 

QPSDAGSRPLLRTGRLDPAGSCSRSLASRAVQ 

RSLAIIRQARQRREKKKBYCMYYNRFGRCm. 

GERCPYIHDPEKVAVCTRFVRGTCKKTDGTC 

PFSHHVSKEKMPVCSYFLKGICSNSNCPYSHV 

YVSRKAEVCSDFLKGYCPLGAKCKKKHTLLC 

PDFARROACPRGAQCQLLHRTQKRHSRKAAT 

.SPAPGPSDATARSRVSASHGPRKPSASQRPTR 

QTPSSAALTAAAVAAPPHCPGGSASPSSSKAS 

SSSSSSSSPPASLDHE\APSLQEAA1AAACSNR 

LCKLPSnSLQSSPSPGAQPRVRAPRAPLTKDS 

GKPLHIKPRL 


842 


2192 


A 


6898 


506 


2071 


WPDLVHTWSSEEAMGSCCSCPDKDTVPDNH 

KNKFKVINVDDDGNELGSGIMELTDTELILYT 

RKRDSVKWHYLCLRRYGYDSNLFSFESGRRC 

QTGQGIFAFKCARAEELFNMLQEIMQNNSIN 

WEEPWERNNHQTELEVPRTPRTPTTPGFAA 

QNLPNOYPRYPSFGDASSHPSSRHPSVGSARL 

PSVGEESTHPLLVAEEQVHTYVNTTGVQEER 

KNRTSVHVPLEARVSNAESSTPKEEPSSIEDR 

DPQILLEPEGVKFVLGPTPVQKQLMEKEKLE 

QLGRDQVSGSGANNTEWDTGYDSDBRKDAP 

SVNKLVYENINGLSIPSASGVRRGRLTSTSTSD 

TQNINNSAQRRTALLNYH^PSLPPVWEARK 

LSRDEDDNLGPKTPSLNGYHNNLDPMHNYV 

NTENVTVPASAHKIEYSRRRDCTPTVFNFDIR 

RPSLEHRQLNYIQVDLEGGSDSDNPQTPKTPT 

TPLPQTPTRRTELYAVIDIERTAAMSNLQKAL 

PRDDGTSR\KTRHNST\DLPL 


843 


2193 


A 


6919 


2 


663 


AORPGTTHASOKMAYQSLRLEYLQIPPVSRA 

YTTACVLTTAAVQLELITPFQLYFNPHJFKHF 

QIWIU-ITNFLFFGPVGFNFLFNMIFLYRYCRM 

LEEGSFRGRTADFVFMFLFGGFLMTLFGLFVS 

LATLGPGLYNN/GSSMCGAEVEPLCPHELLRP 

SQLPGPLSALGAHGIFLWOELNHCGPFGYCS 

WTHIFFLGRCISQSTWWNKNSENTIYFESYF 


844 


2194 


A 


6928 


902 


366 


HRLCMPIQGACGERME/FSLLLPGLECNGVIL 
AHCNLRLPGSSNSPASASQVAGITGVCHHAR 
LIFVFSVETGFLHAGQAGLELLTSGDPPASAS 
QSAGITGKSQHTRPGYEFnPYSAAQEDALKA 
LM 


843 


2195 


A 


6939 


1660 


317 


lypenlgeslfpilllh>pwpdggrpccvems 

trakklrriwrileekesvagavqtlllrsqe 

ggv\tsaaastlsepprrtqesr'ratralglpt 

lpmeklaastepqgprpvlgresvqvpddqd 

frsfrseceaevqwnltysragvsvwvqav 

emdrtlhkkcrmeccdvpaetlydvlhdie 

yrkkwdsnvietfdiarltvnadvgyyswr 

cpbcplknrdvttlrswlpmgadyiimnysvk 

hpkypprkdlvravsiqtgyliqstgpkscvtt 

ylaqvdpkgslpkwwnkssqflapkamkk 

mykaclkypewkqkhlvphfkpwlvhpeqsp 

lpslals\elsvqhads\lbnidesav\aesree 

■r\mggaggeg\sdddtslyaeaphrfretetg 

pgagralgaaaapalsplhppgtwwhrarp 

RRVLQPGWTEPQ 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq^ • 
uencc 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A^Alanine OCysteine, 
D^Aspaitic Acid, E^^Gtutamic Acid, 
F=PhcnylaJaninc, G=G|ycine. Ifr^Kstidine. 
I=Isoleucine, K=Cysine, L=Leucine, 
M=Metbionine, N=A^)aragine, P=^»roIine, 
Q=Gliitaniine, R=Aiginine, S=Serine, 
T=Threonine. V=Valine, W=Tiyptophan, 
Y=Tyroane, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, V=possible 
nucleotide inseitian 


846 


2196 


A 


6944 


42 


2672 


RRKMAGCRGSLCCX:CRWCCX;CGERETRTPB 

ELTTLGETQEEEDEILPRKDYESLDYDRCINDP 

YLEVLETMDNKKGRRYEAVKWMWFAIGV 

CTGLVGLFVDFFVRLFTQLKFGVVQTSVEECS 

QKQCLALSLLELLGFNLTFVFLESLLGLIEPVE 

AGSGITEGKCYLYARQVPGLVRLPTLLWKAL 

GVLLTVAAMLUVGLGSPMIHSGSWGAGLPQ 

FQSISLRKIQFNFPYFRSDRYGK\DKRDFVSAG 

AAAGVAAAFGAPIGGTLFSLEEGSSFWNQGL 

TWKVLFCSMSATPTLNFFRSOIQFOSWOSFQL 

PGLLNFGEFKCSDSDKKCHLWTAMDLOFFV 

VMGVIGGIXGATFNCaJ>IKRLAKYRMRNVHP 

KPKLVRVLESLLVSLVTTWVFVASMVLGEC 

RQMSSSSQIGNDSFQLQVTEDVNSSKTFFCP 

NDTYNDMATLFFNPQESAILQLFHQDGTFSPV 

TLALFFVLYFLLACWTYOISVPSQLFVPSLLC 

GAAFGRLVANVLKSYIGLOHIYSGTFALIGAA 

AFUjGVVRMTISLTVILIESTVNErrYGLPIMVT 

LMVGKWTGDFFNKGIWDIHVGLRGVPLLEW 

ts 1 ii V cMlJIUUKAbUirahrfiL# 1 1 V YPrl ITUQSLV 

SILRTTVHHAFPVVTENRGNEKEFMKGNQLIS 

NNKFKKSSILTRAGEQRKRSQSMKSYPSSEL 

RNMCDBHIASEEPAEKEDLLQQMLERRYTFY 

PNLYPDQSPSEDWTMEERFRPLTFHGLDLRSQ 

LVTU,VRGVCYSESQSSASQPRLSYAEMAH3 

YPRYPDIHDLDLTLLNPRMIVDVIPYMNPSPF 

TVSPNTHVSQVFNLFRTMGLRHLPWNAVGE 

lYGirrRHNLTYEFLQARLRQHYQTl 


847 


2197 


A 


6951 


3 


1994 


NTNSSSVTNSAAOVBDLNIVQVTVPDNEKER 

LSSIEKKQLREQVNDLFSRKFGEAIGVDFFVK 

VFYRXriTNPGCVVlDGMPPGVVFKAPGYLEI 

SSMRRJLEAAEFKFTVIRPLPGLBLSNGEYST 

VGKRKIDQEGRVFQEKWERAYFFVEVQNIST 

OJCKRSMSVSKEyNLRRHYQTNHSKHYDQY 

MERMRDEKLHELKKOLRKYLLGLSDTECyE 

QKQVFANPSFTQKSPVQPVEDLAGNLWEKLR 

EKIRSFVAYSIAIDEITDINNTTQLAIFIRGVDE 

NFDVSEELLDTVPMTGTKSGNEIFSRVEKSLK 

NFCINWSKLVSVASTGTPPMVDANNGLVTKL 

KSRVATFCaCGAELKSICCIIHPESLCAQVKLKM 

DHVMDVWKSVNWTrSRni NHSRFTTT I VPI 
DSQYGSLLYYTEIKWLSRGL\aJKRFFESLEEI 
DSEMSSRGKPLPQLSSEDWIRDLAFLVDMTM 
HLNALNISLOGHSOrVTOMYDLTRAFT AKin 

WETHLTRNNLAHFPTLKLVSRNESDGLNYIP 

KIAELKTEFQKRLSDFKLYESELTLFSSPFSTKI 

DSVHHELQMEVIDLQCNTVLKTXYDKVGIPE 

FYKYLWGSYPKYKHHCAKILSMFGSTYICEQ 

LFSIMKLSKTKYCSQLKDSQWDSVLHIAT 


848 


2198 


A 


6985 


3 


289 


SVQYLPGRPTRTHASTDAPLNfiLKFTPLPSKTK 
ASAPVQCLLLMAATFSPQGLAKPHSGTIPmC 
CFNAINTKIPIQRLESYTRrrNIQCPKEAVM 


849 


2199 


A 


6999 


963 


5 


LDFLCHRDMGDNrrSITEFLLXGFPVGPRIQM 

UJ^IJSLFYVFrLLGNGTILGLISLDSRIJIAP 

MYFFL^HL\AVVDIAYACNTVPRMLVNLLHP 

AKPISFAGRMMQIHJ'STFAVTECLLLVVMS 

YDLYVVAICOTLRYUUMTWRVCITLAVTSWT 

TGVLLSLIHLVUJJ»liTaRPQKIYHFFCEILA 

VLKLACADTHINENMVLAGAISGLVGPLSTIV 

VSYMOLCAILQIQSREVQRKAFCrCFSHLCVI 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
Naof 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspond! 

ng to first 

amino add 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
coire^nding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alamne 0=Cysteine^ 
I>=Aq»rtic Add, £=<jlutamic Add, 
FHPhenylalanine, G=Glycine, H=Histidine, 
]=Isoleucine, K'Cysinc, IM^eucine, 
M=^ethionine, N^Asparagine, P=Pn)line. 
Q=GIutamin^ R=Argtaine, S=Serine, 
T=Threonine, V=Valine:, W=Tiyptopi)an, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
^'^x)ssible nucleotide deletion, V=possible 
nucleotide inseirlion 














GUFYGTAIIMYVGPRYGNPKEQKK.YLLLFHS 
LFNPMLNPUCSLRNSEVKNTLKRVLGVERAL 


850 


2200 


A 


7001 


1 


1011 


MGNDSVSYEYGDYSDLSDRPVDCLDGACLAI 

DPLRVAPLPLYAAIFLVGVPGNAMVAWVAG 

KVARRRVGATWLLHLAVADLLCCLSUILAV 

PIARGOHWPYGAVGCKALPSnLLTMYASVIi 

LAALSADLCFLAIXa'AWVCLRFS/GACXJVQVA 

CGAAWTLALLLTVPSAIYRRLHQEHFPARLQ 

CWDYGGSSSTENAVTAIRFLFGFLGPLVAVA 

SCHSALLCWAARRCRPLGTAIWGFFVCWAP 

YHLLGLVLTVAAFNSALLARALRAEPLIVGL 

ALAHSCLNPMLFLYFGRAQLRRSIPAACHW 

ALRESQGQDESVDSKKSTSHDLVSEMEV 


851 


2201 


A 


7011 


V 


2310 


AAASPLRMSRKGPRAEVCADCSAPDPGWASI 

SRGVLVCDECCSVHRSLGRHISIVKHLRHSA 

WPPTXLQMVHn.ASNGANSIWEHSLLDPAQV 

QSGPALKQTPKDKVXHPKSEFIRAKYQMLAF 

VHKLPOIDDDGVTAKDLSKQLHSSVRTGNLE 

KXRLLSLGAQANFFHPEKGTTPLHVAAKAG 

QTLQAELLWYGADPGSPDVNGRTPIDYARQ 

AGHHELABRLVECXJYH.TDRLAFYLCGRKPD 

HKNGHYDPQMADSLDLSELAKAAKKKLQAL 

SNRLFEELAMDVYDEVDRREM5AVWLATQN 

HSTLVTERSAVPFLPVNPEYSATRNQGRQKL 

ARFNAREFATLmJILSEAKRRQQGKSLSSPTD 

NLELSLRSQSDLDDQHDYDSVASDEDTDQEP 

LRSTGATRSNRARSMDSSDLSDGAVTLQEYL 

ELKKALATSEAKVQQLMKVNSSLSDELRRLQ 

REIHKLQAENLQLRQPPGPVPTPPLPSERAEH 

TmAPGGSTHRRDRQAFSMYEPGSALKPFGG 

PPGDELTTRLQPFHSTELH3DAIYSVHVPAGL 

YRISKGVSASAVPFTPSSPLLSCSQEGSRHTSK 

LSRHOSGADSDYKn-QSraDPLLGLEGKHFLE 

LGKEEDFHPELESUDGDLDPGLPSTEDVlUCr 

EQVTXNIQELLRAAQEFKHDSFVPCSEKIHLA 

VTEMASLFPKRPALEPVRSSLRLLNASAYRLQ 

SECRKTVPPEPGAPVDFQLLTQQVIQCAYDIA 

KAAKQLVrrrTREKKQ 


852 


2202 


A 


7016 


484 


1777 


RISKIQVYYSTOYSSRKMNPTLGLAIFLAVLL 

TVKGLLKPSFSPRNYKALSEVQGWKQRMAA 

KELARQNMDLGFKLLKKLAFYNPGRNIFLSP 

LSISTAFSMLCLGAQDSTUDEIKQGFNFRKMP 

EKDLHEGFHYIIHm.TQKTQDUvLSIGNTLFID 

QRLQPQRKFLEDAKNFYSAEmTNFQNLEM 

AQKQINDFI/ESKTHGKINNLIENrDPQTVMLL 

ANYXFFRARWKHEFDPNVTKEEDFFLEKNSS 

VKVPMMFRSGIYQVGYDDKLSCTILEIPYQK 

NlTAIFILPDEfflCLKHLEKGLQA/DTFSR'WKTL 

LSRRWDVSVPRLHMTGTFDLKKTLSYIGVS 

KJFEEHGDLTKIAPHRSLKVGEAVNKAELXM 

DHIOTEOAAGTGAQTLPMETPLWKIDKPYL 

LLIYSEKIPSVLFLGKIVNHGK 


853 


2203 


A 


7017 


1 


3293 


MIHACNPSTLGGQGRRrraSHGRRRSSRGPV 

ARHVAAGAGHENKHGGSRRFPAGVAPRRAM 

ANVSKKVSWSGRDRDDEEAAPLLRRTARPG 

GGTPLLNGAGPOAARQSHISALFRVGHMSSV 

ELDDELLEPVDMDPPHPFPKEIPHNEKLLSLKY 

ESLDYDNSENQLFLEEERKlNHrAFRTVEIKR 

WVICALIGILTGLVACFIDIWENLAGLKYRVI 

KGSILPNIDKFTEKGGLSFSLLLWATLNAAFV 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 
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beginning 

nucleotide 

location 

corrcspondj 

ng to first 

amino add 

residue of 

pq>tide 

sequence 


Predicted end 
nucleotide 
location 
coiresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanjne OCysteine, 
D=Aspartic Acid, E=<jlutanuc Acid, 
F=Phenylalanine, G=Glycine, IMIistidine, 
I^lsoleucine^ K=Lysine, L^Leucine, 
M^Methicmine, N=AspaiBgine, P=Pn)liiie, 
Q=G!utamine. R=AiBinine, S=Serine, 
.T=Threonine, V=Valine, W=Tiyptophan, 
Y=Tyrosine, X=Unlaiowii, •=Stop codon, 
/^^ssible nucleotide deletion, V^possible 
nucleotide insertion 














LVGSVIVAFIBPVAAGSGIPQIKCFLNGVKIPH 

WRLKTLVIKVSGVILSWGGLAVGKEGPMI 

HSGSVIAAGISQGRSTSLKRDFKIFEYFRRDTIE 

KRDFVSAGAAAGVSAAFGAPVGGVLFSLEEG 

ASFWNQFLTWRIFFASMISTFTLNFVLSrYHG 

NMWDLSSPGLINFORFD^KMAYTIHEIPVn 

AMGWGGVLGAVFNALNYWLTMFRIRYIHR 

PCLQVIEAVLVAAVTATVAFVLIYSSRDCQPL 

QGGSMSYM-QLFCADGEYNSMAAAFFNTPEK 

SWSLFHDPPGSYNH.TLGLFTLVYFFLACWT 

YGLTVSAOVFIPSLLIGAAWGRLFGISLSYLTG 

AAIWADPGKYALMGAAAQLOGIVRMTLSLT 

VIMMEATSNVTYGFPIMLVLMTAKIVGDVFIE 

GLYDMHIQLQSVPELHWEAFVTSHSLTAREV 

MSTPVTCURRREKVGVIVDVLSDTASNHNGF 

PWEHADDTQPARLQGLILRSQLIVLLKHKVF 

VERSNLGLVQRRUaKDFRDAYPRFPPIQSIH 

VSQDERECTMDLSEFMNPSPYTVPQEASLFR 

VFKLFRALGLJRHLVVVDNRNQVVGLVTRKD 

LARYRLGKRGLEELSIAQTGPKAQATAEGRV 

LSLEELSERYESSHPTSTASVPEQDTAKHWNQ 

LEQWWELQAEVACXEHIKQRCERATRSLL 

RELLQVRARVQLQGSELRQLQQEARPAAQAP 

EKEAPEFSGLQNQMQALDKRLVEVREALTRL 

RRRQVQQEAERRGAEQEAGLRLAKLTDLLQ 

QEEQGKEVACGALQKNQEDSSRRVDLEVAR 

M 


854 


2204 


A 


7037 


139 


2604 


AGTWEPRPYDQAKETGAPGSQPPVPPMELRP 

WLLWWAATOTLVLLAADAQGQKVFTNTW 

AVRIPGGPAVANSVARKHGFLNLGQIFGDYY 

HFWHRGVTXRSLSPHRPRHSRLQREPQVQWL 

EQQVAKRRTKRDVYQEProPKFPQQWYL\SG 

VTQNRDLMVKAAWAQGYTGHGIWSILDDGI 

EKNHPDLAGNYDPGASFDVNDQDPDPQPRY 

TQMNDNRHGTRCAGEVAAVANNGVCXJVGV 

AYNAMGGVRMLDGEVTDAVEARSLGLNPN 

HUDYSASWGPEDDGKTVDGPARLAEEAFFR 

GVSQGRGGLGSIFVWASGNGGREHDSCNCD 

GYTNSIYTLSISSATQFGNVPWYSEACSSTLA 

TTYSSGNQNEKQIVTTDLRQKCrESHrGTSAS 

APLAAGUALTLEANKNLTWRDMQHLWQTS 

KPAHLNANDWATNGVGRKVSHSYGYGLLD 

AGAMVALAQNWTTVAPQRKCODILIBPKDI 

GKRIJEVRKrVnrACIXJEPNHITRLEHAQARLT 

LSYNRRGDLAIHLVSPMGTRSTLLAARPHDY 

SADGFNDWAFMTTHSWDEDPSOEWVLEIEN 

TSEANNYGTLTKFTLVLYGTAPEGLPVPPESS 

GCKTLTSSQACWCEEGFSLHQKSCVQHCPP 

GFAPQVLDTHYSTENDVETERASVCAPCHAS 

CATCQGPALTOCLSCPSHASLDPVEQTCSRQS 

QSSRESPPQQQPPRLPPEVEAGQRLRAGLLPS 

HLPEWAOl^CAFTVLVFVTVELVLQLRSGFS 

FRGVKVYTMDRGLISYKGLPPEAWQEECPSD 

SEEDEGRGERTAFKDQSAL 


855 


2205 


A 


7058 


3 


1441 


QRPASQLLAPFAAEALPGAPRAAMAQHFSLA 

AaJWOFDLDHTLCRYNLPESAPLrYNSFAQF 

LVKEKGYDKELLNVTPEDWDFCCKGLALDL 

EDQNFUO-AhWOTVLRASHGTKMMTTEVLA 

EAYGKKEWKHFLSDTGMACRSGKYYFYDN 
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seq- 
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SEQID 
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peptide 
seq- 
uence 


Met 
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nucleotide 
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correspondi 

ngtofiist 

amino add 

residue of 

peptide 
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Predicted end 
nucleotide 
location 
corre^nding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Atanine OCysteine, 
D-Aspartic Acid, E^OIutamic Acid, 
Phenylalanine, G=01yclnei H=Histidine, 
I^Isoleudne, K=Lys!ne, L;=Leucine; 
M^Methionine, N=Aspaiagine, P=Pioline, 
Q=Ghitamine, R=Arginine, S=^erine, 
T=Threonme, V=Valine, W=TiyptoiAan, 
YKTyrosine, X^Unlmown. *-Stop codon, 
A=possibie nucleotide deletion. V=pos5ible 
nucleotide insertion 














YFDLPGALLCARVVDYLTKLNNGQK'1>DFW 

KDIVAAIQHNYKMSAFKENCXJIYFPEIKRDPG 

RVLHSRPESVKXWLRQLKNAGKILLLITSSHS 

DYOtt-LCANmONDFTDLFDIVITNALKPOFF 

SHLPSQRPFRTLENDEEQEALPSLDKPGWYSQ 

GNAVHLYEIXKKMTGKPEPKWYFGDSMHS 

DIFPARHYSNWETVLILEELRGDEGraSQRPE 

ESEPLEKKGKYEGPKAKPLNTSSKKWGSFFM 

DSVLGLENTEDSLVYTWSCKRISTYSTIAIPSr 

EAIAELPLDYKFrRFSSSNSKTAGYYP^a>PLV 

LSSDETUSK 


856 


2206 


A 


7082 


396 


1635 


SSPSVFEFEHAVQPVKI'MEKLICTCVLRRNACT 

AVCFWRSKWQKPSVRRISTTSPRSTVMPAW 

VIDKYGKNEVLRFTQNMMMPIIHYPNEVIVK 

VHAASVNPIDVNMRSGYGATALNMKRDPLH 

VKIKGEEFPLIXGRDVSGWMECGLDVKYFK 

PGDEVWAAVPPWKQGTLSEFVWSGNEVSH 

KPKSLTHTQAASLPYVALTAWSAINKVGGLN 

DKNCTGKRVLILGASGGVGTFAIQVMKAWD 

AHVTAVCSQDASELVRKLGADDVIDYKSGSV 

EEQLKSLKPFDFIU3NV0GSTETWAPDFLKK 

WSGATYVTLVTPFLLNMDRLGIADGMLQTG 

VTVGSKALKHFWKGVHYRWAEFMASGPCL 

DDIAELVDAGKIRPVUEQTFPFSKVPEAFLKV 

ERGHARGKTVINW 


857 


2207 


A 


7088 


320 


2417 


UUU^MTPQSLLQTTLFIXSLLFLVQGAHOR 

OHREDFIU'CSQRNQTHRSSIJHryKPTPDLRISIE 

NSEEALTVHAPFPAAHPASRSFPDPRGLYHFC 

LYWNRHAGRLHLLYGKRDFLLSDKASSLLCF 

QHQEESLAQGPPLLATSVTSWWSPQNBLPSA 

ASFTFSFHSPPHTGAHNASVDMCELKRDLQL 

LSQFLKHPQKASRRPSAAPASQQLQSLESKLT 

SVRFMODMGSFEEDRINATVWKLQPTAOLQ 

DLHfflSRQEEEQSEXMEYSVLLPRTLFQRTKG 

RSGEAEKRUXVDFSSQAUFQDKNSSQVLGE 

KVLGIWQNTKVANLTCPWLTFQHQLQPKN 

VTLQCVFWVEDPTLSSPGHWSSAGCErrVRRE 

TQTSCFCNHLTYFAVLMVSSVEVDAVHKHY 

LSLLSYVGCWSALACLVTIAAYLCSRVPLPC 

RRKPRDYTIKVHMNLLLAVFLLDTSFLLSEPV 

ALTGSEAGCRASADFLHFSLLTCLSWMGLEG 

YNLYRLWEVFGTYVPGYLLKLSAMGWGFPI 

FLVTLVALVDVDNYGPnLAVHR'IPEGVIYPS 

MCWIRDSLVSYITNLGLFSLVFLFNMAMLAT 

MWQILRLRPHTQKWSHVLTLLCLSLVLO\LP 

WALBFFSFASGTFQLWLYLFSirrSFQGFLIFI 

WYWSMRLQARGGPSPLKSNSDSARLPISSGS 

TSSSRI 


858 


2208 


A 


7091 


185 


415 


DAGAVKSSDTNIWFRGMCDDKKGHRCPS*G 

QPQHFHVAFHTEAEOAMFYFRLHVIHRVMQS 

QQQLFPSTLFSWLLE 


859 


2209 


A 


7136 


3 


302 


FFFWRQSLALLPRLECSGATGAHCNLHFPGSS 
DCPTSAS*IAGrrGACYHAWLLFVFLAETGFH 
HVGQGGLELLTSSDPSGSASQSAGITGVSHCT 
WPI 


860 


2210 


A 


7156 


23 


591 


ALSTETRTPDMRRUXVTSLVVVIXWEAGAV 
PAPKVPnCMQVKHWPSEQDPEKAWGARWB 
PPaaJDQLWLFPVQKPKLLTTEEKPRGQGR 
GPELPGTKAWMETEDTLGRVLSPEPDHDSLY 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
p^tide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/4% 

nt A 
9J4 


Piedicted 

beginning 

nucleotide 

location 

conespondi 

ng to first 

amino add 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of pqitide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
I>=Aspartic Acid, E=Glutsmic Acid, 
F=Phenylalanine, G=Glycinei It^Histidine, 
I=Isoleudne, K-^^ysine, L^Leucine, 
M=Metliianine> N^A^iaragine, FHProline, 
QOlutamine; R=Aiginin^ S=Serinei 
T=Threonine, V=Valine, W=Tiyptopiian, 
Y=<IVn)sine, X^Unknown, "^top codon, 
/=^ssible nucleotide deldios, V=possible 
nucleotide insation 














HPPPEEDQGEERPRLWVMPNHQVLLGPEEDQ 
DHTYHPQ^GSRGHHCPRPVPRPRLLGLGPSLP 
CPS 


861 


2211 


A 


7161 


1220 


1003 


NYVCnAF*EKKMGF*LSLSCLVLLFVLFLDa 

LTirnUNIFHCrYLFASVCLSLLNTLLSPNCL 

KSAMILQ 


862 


2212 


A 


7211 


665 


847 


UfLY YHITMGIYKTGKKVIL'KSSMSNRFSVIF 
YKNIQKLSFSNYVYHQNYVFSSDWSYDF 


863 


2213 


A 


7212 


924 


1273 


HGSSCALGDLAPG*LPSGPVLSSPAVRL*RKP 
LVWDSPSCLPATGPT*GLVLVLGGPDCT»WA 
RGQHEHKRMRAP*SCRVTVNLAKKKKKTDQ 
CKPNYQSPPKECDYNILANSVA 


864 


2214 


A 


7214 


845 


1619 


SDKGGKKADRKNHLRHAFPLLPHRVRERLH 
DPKVPVDADHVQGQDPGRAAHDIHGEDVTE 
KVSKDPLAHDEVGDTDEGHDRHGHREVGQR 
HGHDQEEVAYEERACEGOKFATVEVTDKPV 
DEALREAMPKVAKYAGGTNDKGIGMGMTV 
PISFAVFPNEDGSLQKKLKVWFRIPNQFQSDP 
PAPSDKSVKIEEREGrrVYSMQFGGYAKEAD 
YVAQATRLRAALEGTATYRGDIYFCTGYDPP 
MKPYORRNEIWLLKT 


863 


2215 


A 


7246 


559 


682 


RRLGAVAHAYTSSTLGGRGGWrr»GQELQTS 
LANMAKPRLY 


866 


2216 


A 


7257 


641 


1310 


TCTYKYLMGWIR.ORRSRHSWEMSEFHNYNL 

DLKKSDFSTRWQKQRCPWKSKCRQJASPFF 

FCCFIAVAMQIRFnMVAIWSAVFLNSLFNQEV 

QIPLTESYCGPCPKNWICYKNNCYQFFDESFCN 

WYESQASCMSQNASLLKVYSKEDQDIXKLV 

KSYHWMGLVKPTNGSWQWEDGSILSPNLLT 

nEMQKGDCALYASSFKGYENCSTPNTYICM 

QRTV 


867 


2217 


A 


7288 


151 


396 


SIKIIEAFGSNGPDFWFFRYWSP*LFRQQVVF1 
MPFFQTL\WLJvlN/U«U'CSIFTTTNVANNCWW 
TPYHCWLSVWCRCESHGl 


868 


2218 


A 


7298 


3 


272 


PDTVIGGRGSGGKEFGRWVLW*VFE*RLGTP 
KGSCPAGGSRMVSESD*EGRGC*ASYPCAC* 
A0S*WR*OSRPA<aiGTPPRSLSHARPP 


869 


2219 


A 


7332 


1223 


332 . 


PRRDAEDRDESCLNPAFPIGLLHPNSVNSMAR 

FLTLCrWLLLLGPGLLATVRAECSQDCATCS 

YRLVRPADINFLACVMECEGKLPSLKIWErC 

KELLQLSKPELPQDGTSTLRENSKPEESHLLA 

KRYGGFMKRYGGFMKKMDELYPMEPBEEA 

NOSEBLAKRYGGFMKKDAEEDDSLANSSDLL 

KELLETGDNRERSHHQDGSDNEEEVSKRYGG 

FMRGLKRSPQLKBKAKELQKRYGGFMRRVG 

PQKW»MTSa>QNRYGGFLKRFAEALPSDEE<aB 

SYSKEVPEMEKRYGGFMRF 


870 


2220 


A 


7382 


216 


1018 


EIHQRLTERTQFLDESRKNFNS*QANLLRGGG 

AGQGRGREOAESGGSRGEGPGSDGRLPATGD 

FWSPRSQRRGCCGRRAPRPEAMENQAVYSPT 

TEEDPGPARGPRSGLAAYFFMGRLPLLRRVL 

KOl^QiXtfoXXAriCiis V VoQC 1 LCGuLYFrEr 

VSCSAFlXSIin-IVYCnTYERVDTrKVKSSD 

FraiXrrOCVFIXASnFVSIHDRTSAEIAArVF 

GFIASFMFLLDFnMLYEKRQESQLRKPENTT 

RAEALTEPLNA 


871 


2221 


A 


7403 


3 


393 


SCAMCSGLL'UJLPIWLSWTLGTRGSEPRSVN 
DPGNMSFVKETVDKLLTOFRCFREREAAPRR 
AIROAAIPOESEAODPESLRSSVNADWIQYS 
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SEQ ID 
NO: of 

eotide 
seq- 
uence 


SEQ ID 
NO: of 
Peptide 
seq- 
uence 


Met 
hod' . 


SEQ 
ID NO: 

USSN 

914 


Predicted 

beginning 

nucieoiKic 

location 

(^irespouQi 

ngtofiist 
amino acid 
residue of 
peptide 
sequence 


Predicted end 
nucleotide 
location 
corre^nding 
to last amino 
acid residue 
of peptide 
sequence 


Amino add sequaice (A'^Alanine C=Cysteine, 
D=Aspartic Acid, E^lutamic Add, 
r— raenyiaiaiune, u=vjlycine, i^Histidine, 
Msoleucine, K.=Lysine, IM^cine, 
h^Mcthioninc^ N^Asparaginc^ I^Proline, 
Q=01utamine, R=Aiginine, S=Serine, 
T^Threonine, V=Valine. W=TiyptDphan, 
Y=TyrDsine, X=<Jnbiowii, *=^top codoo, 
/=possit>le nucleotide deletion, V=possible 
nucleotide insertion 














DLWEAEVSTPRCEAGFCQECFRTPQNQEKDG 
PFIC 


o /X 


•vvyi 

r / f f 


A 
f\ 




lUDl 


351' 


FVDIVSWEFPHCPEARFPAQHGQDSKRLTLC 

PGGS*PQATLHLDRMRVSASPTFaBIQVKKYK 

CoLlKPCrANYrAFKlCSGAANVVGFrMCFED 

RMIMSPVKNNVQRGLNIALVNOTTOAVLOQ 

KAFDMYSGDVMHLVKFLKEIPGC3ALAa-VAS 

YDDPGTKMNDESRKLFSDLGSSYAKQLCTRD 

SWVFIGAKDLRGKSPFEQFUCEQPQTQNKYE 

GWPELLEMEGCMPPKPF 


873 




A 


7429 


2242 


2394 


ILKCAGHGGSCL*SQHFGRLRWEDRIJILGVQ 
DHPGQHCETPSLLKIERKLF 


874 


2224 


A 


7468 


146 


894 


PCTSCVLWATLHU>ASTRKAPQAEC(»<IISrrE 
WQKIGVGrrOFOIFFILFGTLLYFDSVIXAFGN 
LIJLTGLSLIIGLRKTFWFFFQRHKLKGTSFLL 
GGWIVLLRWPUXJMFLETYGFFSLFKGFFPV 
AFQF1XjNVCK[H=1XiALFRR1.QOTSSMV*KTE 
MSSLNLDHWLKGAKREEWEPPPQSPALTHSP 
TYPGPPQVQKERNGAEQLTSNPQVDSRGCQE 
AEMQTPRRLGWGWYHTLTLYLWEEK 


875 


rws 


A 


7498 


91 


251 


GEKPVPTWLQDEAGQWLLGFVAQPWGWPG 
SERHEP»HGGVLFRLGPSAPPOa. 


876 


2226 


A 


7544 


403 


387 


YSCIXTLFKHrrSFKNSVHIWLGTVVHAYNPN 
ILGGQGGWIA'GQEFKTSLGNTVRPCLYK 


877 


2227 


A 


7566 


2 


940 


GCAPDTRFFVPEPGGRGAAPWVALVARGGC 

TFKDKVLVAAKRNASAWLYNEERYGNnXP 

MSHAGTfflCWIMISYPKGREILELVQKGIPV 

TMnGVGTRHVQEFISGQSVVFVAIAFITMMU 

SLAWLIFYYIQRFLYTGSQIGSQSHRKETKKVI 

GQLLLHTVKHGEKGIDVDABNCAVCIENFKV 

KDIIRILPCKHIFHRICIDPWLLDHRTCPMCKL 

DVKALGYWGEPGDVQEMPAPESPPGRDPAA 

NLSLALPDDDGSDESSITSASPAESEPQCDPSF 

KGDAGENTAIXEAGRSDSRHGGPIS 


878 


2228 


A 


7586 


315 


1232 


ERSLLCKVDVRWIYVSEGTKTQRRHRQGSLR 

RGRMQAACWYVLELLQPTVYLVTCANLTNG 

GKSELI^JCSGSSICSIUCHIWTESSKDLSISRLLS 

QTFRGKENDTDLDLRYDTPEPYSEQDLWDW 

LRNSTDLQEPRPRAKRRPIVKTGKFKKMFGW 

GDFHSNKTVKLNLUTGKIVDHGNGTFSVYF 

RHNSTGQGNVSVSLVPPTBaVEFDLAQQTVID 

AKDSJCSFNCRIEYEKVDKATKNTLCNyDPSK 

TCYQEQTQSHVSWLCSKPFKVICIYISFYSTD 

YKLVQKVCPDYNYHSDTPYFPSO 


~879 




A 


iOUj 




■aoi 
dVI 


X oa WJtLK W Warl CJJXJLNGSAPGNVFIHG 


880 


2230 


A 


7612 


93 


659 


DAAVAMTAQGGLVANRGRRFKWAIELSGPG 

GGSRGRSDRGSGQGDSLYPVGYLDKQVPDTS 

VQETDRILVEKRCWDIALOPLKQIPMNLFIMY 

MAGKnSIFPIMMVCMMAWRPIQALMAISAT 

FKMLESSSQKFLQGLVYLIGNLMGLALAVYK 

CXJSMGLLPTHASDWLAFIEPPERMEFSGGGL 

LL 


881 


2231 


A 


7615 


291 


1452 


SPQKTMRSHTITMTTTSVSSWPYSSHRMRFIT 

NHSDQPPQNFSATTNVTTCPMDEKLLSTVLTr 

SYSVIFIVGLVGNIIALYVFLGIHRKRNSIQIYL 

UWOADlJXIFCLPFRIMYHINQNKWTLGVn- 

CKWGTIJFYMNMYISIILLGFISLDRYIKINRSI 

QQRKAirrKQSIYVCXIIVWMLALGGFLTMlIL 

TLKKGGHNSTMCFHYRDKHNAKGEAIFNFIL 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
p^tidc 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

begnming 

.nucleotide 

location 

conespondi 

ng to first 

amino add 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phcnylalanine, G=Glycine, H=*Histidine, 
I=Isoleucine, K=Lysine, L=Leucinc, 
M=4*4ethiDnine, N=Asparaginei P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tiyptophan, 
Y=Tyrosinc, X=Unknovim, *=Stop codon, 
^^Kjssible nucleotide deletion, V=possible 
nucleotide insertion 














VVMFWUFIXin^YlKJGKMLLRlSKRRSKFPN 
SGKYATTARNSFIVLIIFTICFVPYHAFRFIYISS 
QLNVSSCYWKEIVHKTNEIMLVLSSFNSCLDP 
VMYFLMSSNIRKIMCQLLFRRFQGEPSRSEST 
SEFKPGYSLHDTSVAVKIQSSSKST 


882 


2232 


A 


7617 


67 


379 


RQMALLKANKDLISAGLKEFSVLLNQQVFND 
PLVSEEDMVTWEDWMNFYINYYRQQVTGE 
PQERDKALQELRQELNTLANPFLAKYRDFLK 
SHELPSHPPPSS 


883 


2233 


A 


7622 


400 


213 


KVKTCRYNPKYSAANDTGFVDIPSREKDLAK 
AVATVGPISVAVGASHVFFQFYKKGKHLSS 


884 


2234 


A 


7638 


2640 


2861 


APVLILQKfVKLSrVLTPQFLSHDQGQLTKELQ 
QHVKSVTCPCEYLRKVSECRQMGPGALEQFP 
GLSCHTSHSG 


88S 


2235 


A 


7642 


201 


455 


PSRGKMELEAMSRYTSFVNPAVFPHLTWLL 
AIGKOTnrAWFFVYEVrSTKYTRDIYKELUSL 
VASLFMGFGVLFLLLWVGIYV 


886 


2236 


A 


7692 


61 


569 


APENPFSRQHFNSETKVKLSLKTGTWLGNHA 

HLGEHFSTHHELGLSGKVVGFLVKNILEViKN 

GGMETRHPOKVSSWFHRWDSRAEQHNHAE 

HHEDVPQGDEDSKVSEAQQEFPDWTCAGLP 

GIXPKALRVLLFQLKVQHRPGIHQQRPEQQD 

VSDHRYGRSVRQNRK 


887 


2237 


A 


7693 


85 


315 


NPGCCLPVAMRTSYLLUTLCLLLSEMASGG 

NFLTGLGHRSDHYNCVSSGGQCLYSACyiFTK 

IQGTCYRGKAKCCK 


888 


2238 


A 


7702 


242 


1298 


AFSHRRRYLSPSRSAGQLGNMALERLCSVLK 

VLLrrVLWEGIAVAQKTQDGQNIGKHIPAT 

QCGIWVRTSNGGHFASPNYPDSYPPNKECIYl 

LEAAPRQRIELTFDEHYYIS'SFBCRFDHLEVR 

DGPFOFSPUDRYCGVKSPPLIRSTORFMWIKF 

SSDEELEOLGFRAKYSFIPDPDFTYLGGILNPIP 

DCQFELSGADGIVRSSQVEQEEKTKPGQAVD 

CrVITKATPKAKIYlJRFLDYQMEHSNECKRNF 

VAVYDGSSSDENLKABtFCSTVANDVMLKTGI 

GVIRMWADEGSRLNRFRMLFTSFGGASPAQA 

ALSFCHSNMCINNSLVCNOVQNCAYPWDEN 

HC 


889 


2239 


A 


7707 


185 


2911 


CHYIMNPSTHHPASAGGSILGLFDFFGLGLGE 

MTMDALLABLKLLNPDDLREEIVKAGLKCGP 

rrSrniFIFEKKLAQALLEQGGRLSSFYHHEA 

GVTALSQDPQRILKPAEGNITOQAGFSEDRDF 

GYSVGLNPPEEEAVTSKTCSVPPSDTOTYRAG 

ATASKEPPLYYGVCPVYEDVPAKNERIYVYE 

NKKEALQAVKMDCGSRFKAFSTREDAEKFAR 

GICDYFPSPSKTSLPLSPVKTAHJ'SNDRLKDG 

LCLSESETVNKERANSYKNPRTQDLTAKLRK 

AVEKGEEDTFSDUWSNPRYUOSGDNPTIVQ 

EGCRYNVMHVAAKENQASICQLTLDVLENP 

DFMRLMYPDDDEAMLQKRIRYWDLYLNTP 

DKMGYDTPLHFACKFGNADWNVLSSHHU 

VKN SKNKYDKTPEDVICERSKI^KS VEIJCERIR 

EYLKGHYYVPLLRAEETSSPVIGELWSPDQTA 

EASHVSRYOGSPRDPVLTLRAFAOPLSPAKAE 

DFRKLWKTPPREKAGFLHHVKKSDPERGFER 

VGRELAHELGYPWVEYWEFLGCFVDLSSQE 

GLQRUffiYLTQQEIGKKAQQETGEREASCRD 

KATTSGSNSISVRAFLDEDDMSLEEKNRQNA 

ARNNSPPTVGAFGHTRCSAFPLEQEADLIEAA 
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SEQ ID 
NO; of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
bod 


SEQ 

IDNO: 

in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

coirespondi 

ng to first 

amino acid 

residue of 

peptide 

Sequence 


Predicted end 
nucleotide 
location 
coue&ponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
I>=Aspartic Acid, E=<Thitamic Acid, 
F=Phenylalamne, G=Glycinc, H=Histidine, 
I*4soleucine, K=Lysine, L^Lcucine, 
Kt^elhionine, N=^Asparagine, P^Prolinc, 
Q=Glutamine, R=Arginine, S=Serine, 
■p=Threonine, V=Valine, W=Tiyptophan, 
Y=iyrosine, X=Unknown, ♦=Stop codon, 
A^ssible nucleotide deletion, V=possible 
nucleotide insertion 














EPGGPHSSRNGLCHPLNHSRTLAGKRPKAPR 

OEEAHU»PVSDLTVEFDKLNLQNIGRSVSKTP 

DESTKTKDQILTSRINAVERDLLEPSPADQLG 

NGHRRTESEMSARIAKMSLSPSSPRHEDQLEV 

TOEPAIUILFIJFGEEPSKLDQDVLAALECADV 

DPHQFPAVHRWKSAVLCYSFSDRQSWPSFAV 

KGRFKSQLPDLSGPHSYSPGRNSVAGSNPAKP 

GLGSPGRYSPVHGSQLRRMARLAELAAL 


890 


2240 


A 


7711 


360 


269 


RHMPVEPALWEAEVGGLLEPRSSRSAWATE 


891 


2241 


A 


7721 


61 


1175 


KLPWEPSFUKMQIIRHSEQTLKTALISKNPVL 

VSQYEKUDAGEQRLMNEAFQPASDLFGPITL 

HSPSDWrrSHPEAPQDFEQFFSDPYRKTPSPN 

KRSIYIQSIGSLGNTRnSEEYIKWLTGYCKAYF 

YGLRVKLLEPVPVSVTRCSFRVNENTHNLQIH 

AGDnJCFLKKKKPEDAFCWGITMIDLYPRDS 

WNFVFGQASLTDGVGIFSFARYGSDFYSMHY 

KGKVKKLKKTSSSDYSIFDNYYIPEITSVLLLR 

SCKTLTHEIGHIFGLRHCXJWLACLMQGSNHL 

EEADRRPLNLCPICLHKLQCAVGFSrVERYKA 

LVRWIDDESSDTPGATPHHSHEDNGNLPKPV 

EAFKEWBCEWmCCLAVLQK 


892 


2242 


A 


7723 


2 


1650 


SAPTAPARPCRAERGSGGGMLALLAASVALA 

VAAGAQDSPAPGSRFVCTAUPPEAVHAGCPL 

PAMPMCJGGAQSPEEELRAAVLQLRETWQQ 

KETLASARAIRELTGKLARCEGLAGGKARGA 

GATGKDTMGDLPRDPGHWEQLSRSLQ'ILK 

DRLESLEPLPAMPMQGOAQSPEEELRAAVLQ 

LRETWQQKETLASARAIRELTGKLARCEGL 

AGGKARGAGATGKDTMGDLPRDPGHWEQ 

LSRSLQTLKDRLESLEHQLRANVSNAGLPGD 

FREVLQQRLGELERQLLRKGAELEDEKSLLH 

NETSAHRQKTESTLNALLQRVTELERGNSAF 

KSPNAFKVSLPLRTNYLYGKIKKTLPELYArr 

ICLWLRSSASPGMGTPFSYAVPGQANEIVLIE 

WGNNPIELLINDKVAQLPLFVSDGKWHHICV 

ITrrXRDGMWEAFQDGKKLGTGENLAPWHPI 

KPGGVLILGQEQDTVGGRFDATQAFVGELSQ 

FNIWDRVLRAQETVNIANCSTNMPGNIIPWVD 

NNVDVFGGASKWFVETC3EERLLDL 


893 


2243 


A 


7729 


3554 


2419 


LTAGTAMNYPLTLEKIDLENLEDLFWELDRL 

DNYNDTSLVENHLCPATEGPLMASFKAVFVP 

VAYSUFLLGVIGNVLVLVILERHRQTRSSTET 

FLFHLAVADLLLVFILPFAVAEGSVGWVLGTF 

LCKTVIALHKVNFYCSSLLLACIAVDRYLAIV 

HAVHAYRHRRlXSIHlTCQTIWLVOFLLAJJ»EI 

LFAKVSQGHHNNSLPRCTFSQENQAETHAWF 

TSRFLYHVAGFLLPMLVMGWCYVGWHRLR • 

QAQRRPQRQKAVRVAILVTSIFFLCWSPYHIV 

IFLDTLARLKAVDNTCKLNGSLPVATTMCEFL 

GLAHCCLNPMLYTFAGVKFRSDLSRLLTKLG 

CTGPASLCQLFPSWRRSSLSESENATSLTTF 






A 
n 


/ OO 


07U 


287 


FVTRAGRWGAGARVRGGAGGMASGAARWL 

VLAPVRSGALRSGPSLRKDGDVSAAWSGSGR 

SLVPSRSVIVTRSGAILPKPVKMSFGLLRVFSI 

VffFLYVGTLISKNFAAJUUEEHDIFVPEDDDDD 

D 


89S 


2245 


A 


7753 


119 


278 


APYAHSQVHCLDKVCGLLPFUNPEVPDQFYR 
LWLSLFLHAGKEAPHCPRTRPL 


896 


2246 


A 


7754 


1 


372 


SPAWWNSQQRWSPFLALLTLEPTFHHLLPIM 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
bod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

b^inning 

nucleotide 

location 

correspondi 

ngtofiist 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last ammo 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F°<PbenylaIanine, G=Glycine, [HHistidine, 
I=Isoleudne, KHLysine, WLeuciae, 
M=^ethianine, N=Asparagine, P=Ptoline, 
Q=Glutamine, R=Argin(ne, S=Serinei 
P=Threoninei, V=Valine, W=TryptqAan, 
Y=Tyrosinei, X°4Jiilaiown, *=Stop codon, 
A°possibIe nucleotide deletion, V=possible 
nucleotide insertion 














QVSTAAI^VLLCTMALCNQVLSAPLAADTPT 
ACCFSYTSRQIPQNFIADYFETSSQCSKPSVIFL 
TKRGRQVCADPSEEWVQKYVSDLELSA 


897 


2247 


A 


7761 


1725 


445 


RPRRRGTHHFSCVLGSFRVSAMFPRVSTFLPL 

RPLSRHPLSSGSPETSAAAIMLLTVRHGTVRY 

RSSALLARTKNNIQRYFGTNSVICSKKDKQSV 

RTEETSKETSESQDSEKEimCKDLLGIIKGMK 

VELSTVNVRTTKPPKRRPLKSlJEA"n.GRLRRA 

TEYAPKKREEPLSreLVAAASAVADSLPFDKQ 

TTKSELLSQLQQHEEESRAQRDAKRPKISFSNI 

ISDMKVARSATARVRSRPELRIQFDEOYDNYP 

GQEODDIJKKRKNnT-GKBlJflFDMMAVTKE 

AEETDTSPSLWDVEFAKQLATVNEQPLQNGF 

EELIQWTKEGKLWEFPINNEAGFDDDGSEFH 

HnFLEKHLESFPKQGPIRHFMELVTCGLSKNP 

YLSVKQKVEHIEWFKNYFNEKKDILKESNIQF 

KLRPWKFLPRNN 


898 


2248 


A 


7775 • 


85 


496 


SCQTTQPPAQSCSTGIMRIMLLFTAILAFSLA 

QSFGAVCKEPQEEWPGGGRSKRDPDLYQLL 

QRLFKSHSSLEGLLKALSQASTDPKESTSPEK 

RDMHDFFVGLMGKRSVQPDSPTDVNQENVP 

SFGILKYPPRAE 


899 


2249 


A 


7785 


179 


703 


PFHLGASSNTFRLQVQTQESKAQKEVKMGFI 

FSKSMNESMKNQKEFMLMNARLQLERQLIM 

QSEMRERQMAMQIAWSREFLKYFGTFFGLA 

AISLTAGAIKKKKPAFLVPIVPLSFILTYQYDL 

GYGTLLERMKGEAEDILETEKSKLQLPRGNDT 

FESDEKARKEQSRFFIDK 


900 


2250 


A 


7789 


1465 


300 


VWLPLKSYKIRSPSLHCQCEIFREEFLFSSLQE 

GRDKDTFSKMAMVSEFLKQAWFIENEEQEY 

VQTVKSSKGGPGSAVSPYPTFNPSSDVAALH 

KAIMVKGVDEATIIDIL'nCRNNAQRQQIKAAY 

LQETGKPLDETliCKALTGHLEEWLALLKTP 

AQFDADELRAAMKGLGTDEDTLIEILASRTN 

KEIRDINRVYREELKRDLAKDITSDTSGDFRN 

ALLSLAKGDRSEDFGVNEDLADSDARALYEA 

GERRKGTDVNVFNTILTTRSYPQLRRVFQKY 

TKYSKHDMNKVLDLELKGDIEKCLTAIVKCA 

TSKPAFFAEKLHQAMKGVGTRHKALIRIMVS 

RSEroMNDKAFYQKMYGISLCQAILDETKGD 

YEKILVALCGGN 


901 


22S1 


A 


7796 


2 


807 


VEFHPQRARAGARAPSMGVLLTQRTLLSLYL 

ALLFPSMASMAAIGSCSKEYRVLLGQLQKQT 

DLMQDTSRLLDPYnUQGLDVPKLREHCRERP 

GAFPSEETLRGLGRRCFLQTLNA'n-GCVLHRL 

ADLEQRLPKAQDLERSGLNIEDLEKLQMARP 

NELGLKNNIYCMAQLLDNSDTAEPTKAGRGA 

SQPPTPTPASDAFQRKLEGCRFLHGYHRFMH 

SVGRVFSKWGESPNRSRRHSPHQALRKGVRR 

TRPSRKGKRLMTRGQLPR 


902 


2252 


A 


7802 


2 


721 


TAARRRQKGTAARRLQKGTAARRRQKGTAA 
RRRQKGTAARRPQKGTAARRRQKGTAARRR 
QKGTAARRRQKGTAARRPQK.GTAARRRQKG 
TAARRRQKOTAARRRQKGLAIASRGCPCASR 
AGGVRGAOSRLRAMAPKVERQYWDIPDGTD 
CHRKAYSTTSIASVAOLTAAAYRVTLNPPGTF 
LEOVAKVGQYTFrAAAVGAVFGLTTCISAHV 
REKPDDPLNYFLGGCAGGLTLG/^THNYGIG 
AAACVYFGIAASLVKMGRLEGWEVFAKPKV 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
pelade 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino add 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino add sequence (A=Alanine OCysteine, 
D^Aspartic Add, E=Glutamic Add, 
F^Phenylalanine, G=Glycine, H=Histidine, 
^Isoleucine, K=Lysine, IMjBucine, 
M=Methionine^ T>I=Asparagine, P=Proline, 
Q=Glutaniinei R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tiyptnphan, 
Y=Tyrosine, X^Unknown, ♦=Stop codcHi, 
/^possible nucleotide deletion, V=possible 
nucleotide insertion 


903 


2253 


A 


7807 


1 


584 


PWLPWSDQRAARSSRKCPRSRFPYQVGKMA 

VSTVFSTSSUvlLALSRHSLLSPLLSVTSFRRFY 

RGDSPTDSQKDMIEIPLPPWQERTDESIErKR 

ARLLYESRKRGMLENOLLSLFAKEHLQHKfr 

EKQLNLYDRLINEPSNDWDIYYWATBAKPAP 

EIFENEVMALLRDFAKNKNKBQRLRAPDLEY 

LFEBCPR 


904 


2254 


A 


7813 


40 


821 


GAGRALGHLETGAGDVAAALPARKFPRSLLG 

AGAW.TGWTMNVFRILGDLSHLLAMILLLGK 

IWRSKCX3CGISGKSQILFALVFITRYLDLFTNF 

ISIYNTVMKYVnJXAYVTVYMFV'GKFRK'IF 

DSE^roTFRlJEFLLVl^GLSFLENYSFTLLEIL 

WTFSm,ESVAIU»QLEMISKTGEAETmHYL 

FFLGLYRALYLANWIRRYQ1TOFYDQIAWS 

GWQTIFYCDFFYLYVTKGRSWDDSNADTGL 

RSYSSI 


903 


2255 


A 


7817 


1399 


881 


LSNKDVLSPQUajENSKLRKKLNEVQSFSEA 

QTTEMVRTLERKLEAKMKEESDYHDLESVVQ 

QVEQNLELMTKRAVKAENHWKLKQEISLL 

QAQVSNFQRENEALRCGQGASLTWKQNAD 

VALQNLRWMNSAQASIEQLVSGAETLNLVA 

EILKSIDRISEVKDEEEDS 


906 


2256 


A 


7822 


3 


1462 


DSPRNRFEILGRPTRTPTRPGPRPAMEDLDAL 

LSDLETTTSHMPRSGAPKERPAEPLTPPPSYG 

HQPQTGSGESSGASGDKDHLYSTVCKPRSPK 

PAAPAAPPFSSSSGVLGTGLCELDRLLQELNA 

TQFNTTOEIMSQFPSSKVASGEQKEDQSEDKK 

RPSLPSSPSPOI^KASATSATLELDRLMASLSD 

FRVQNHLPASGPTQPPWSSTNEGSPSPPEPTG 

KGSLDTMLGLLQSDLSRRGVPTQAKGLCGSC 

NKPUGQWTAIGRAWHPBHFVCGGCSTAL 

GGSSFFEKDGAPFCPECYFERFSPRCGFCNQPI 

RHKMVTALGTHWHPEHFCCVSCGEPFGDEG 

FHEREGRPYCRRDFLQLFAPRCQGCQGPILDN 

YISALSALWHPDCFVCRECFAPFSGGSFFEHE 

GRPLCENHFHARRGSLCATCGLPVTGRCVSA 

LGRRFHPDHFTCTFCLRPLTKGSFQERAGKPY 

CQPCFLKLFG 


907 


2257 


A 


7828 


1792 


1671 


FIYVNQSFAPSPDQEVGTLYECFOSDGKLVLH 
YCKSQAWG 


908 


2258 


A 


7842 


110 


1172 

• 


KLSCPCSHGTRVTAVRGPRLKAGVQWHDLG 

SLQPPPSGLKQSSHLSLSSSWDFRHAPTHPET 

YTa>KMIEMEQAEAQLAELDLLASMFPGENE 

LrVNDQLAVAELKDCIEKKTMEGRSSKVYrn 

NMNLDVSDEKMAMFSLACILPFKYPAVLPEI 

TVRSVLLSRSQQTQLNTDLTAFLQKHCHGDV 

CILNATEWVREHASGYVSRDTSSSPTTGSTVQ 

SVDLIFTRLWIYSHHIYNKCKRKNILEWAKEL 

SLSGFSMPGKPGWCVEGPQSACEEFWARLR 

KUWKHILIRHREDIPFDGTNDETERQRKFSIF 

EEKVFSVNOARGNHMDFGQLYQFLNTKGCG 

xj V F\^jyirj_f TV V 


909 


2259 


A 


7870 


3067 


2923 


EGICVYTOYVHMYTRTCMHTYPYMYMNSV 
LISSEILLIPSKYLFESK 


910 


2260 


A 


7884 


212 


4874 


GALTWSHPLIAVCPQOVWLGSTTSGSPALUP 
PSHRVNAEPGCWTNACASGPCPPHANCRDL 
WQTFSCTCQPGYYGPGCVDACLLNPCQNQG 
SCRHLPGAPHQYTCDCVGGYFGHHCEHRMD 
QQCPRGWWGSPTCGPCNCDVHKGFDPNC3>JK 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

conespondi 

ng to first 

amino add 

residue of 

pqjtide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino • 
add residue 
of peptide 
sequence 


Ammo acid sequence (A=Alanine OCysteine, 
I>=Aspartic Acid, EF<jlutamic Acid, 
F=Phenylalanine, G=Glycine, Ei=ISstidine. 
I=IsoIeucine, K=Lysine, L=Leucinei 
M=Methi(mine, N=Aspaiagine, P=ftt)line, 
Q=Glutamine, R=Ajginine, S=ScriBe, 
T=Threonine, V=Valine, W=Tiyptoplian, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, V=possiblc 
nucleotide insertion 














TNGQCHCKEFHYRPRGSDSCXPCDCYFVGST 

SRSCAPHSGQCPCRPGALGRQCNSCDSPFAEV 

TASGCRVLYDACPKSLRSGVWWPQTKFGVL 

ATVPCPRGALGLRGAGAAVRLCDEAQGWLE 

PDLFNCTSPAFRELSLLLDGLELNKTALDTME 

AKKLAQRLREVTGHTOHYFSQDVRVTARLL 

AHLLAFESHQQGFGLTATQDAHFNENLLWA 

GSALLAPETGDLWAALGQRAPGGSPGSAGLV 

RHLEEYAATLARNMELTYLNPMGLVTPNIML 

SIDRMEHPSSPRGABRYPRYHSNLFRGQDAW 

DPHTHVLLPSQSPRPSPSEVLPTSSSIENSTTSS 

WPPPAPPEPEPGISniLLVYRTLGGLLPAQFQ 

AERRGARU>QNPVMNSPVVSVAVFHGRNELR 

GILESPISUEFRLLQTANRSKAICVQWDPPGLA 

EQHGVWTARDCELVHRNGSHARCRCSRTOT 

FGVLMDASPRERLEGDLELLAVFTHWVAVS 

VAALVLTAAUXSLRSLKSNVRGIHANVAAA 

LGVAELLFLLGIHRTHNQLVCTAWILLHYFF 

LSTFAWLFVQGUiLYRMQVEFRNVDRGAMR 

FYHALGWGVPAVLLGLAVGLDPEGYGNPDF 

CWISVHEPUWSFAGPVVLVrVMNGTMFLLA 

ARTSCSTGQREAKKTSALTLRSSFLLLLLVSA 

SWLFGLLAVNHSILAFHYLHAaLCGLQOLAV 

LLLFCVLNADARAAWMPACLGRKAAPEEAR 

PAPGLGPGAYNNTALFEESGLIKnXGASTVSS 

VSSARSGRTQDQDSQRGRSYLKDNVLVRHGS 

AADHTDHSLQAHAGPTDLDVAMFHRDAGA 

DSDSDSDLSLEEERSLSIPSSESEDNGRTRGRF 

QRPLCRAAQSERLLTHPKDVDGNDIXSYWPA 

LGECEAAPCALQTWGSERRLGLDTSKDAAN 

NNQPDPALTSGDETSLGRAQRQRKGILKNRL 

QYPLVPQTRGAPELSWCRAATLGHRAVPAAS 

YGRIYAGGGTGSLSQPASRYSSREQLDLLLRR 

QLSRERIEEAPAPVLRPLSRPGSQECMDAAPG 

RLEPKDRGSTLPRRQPPRDYPOAXiAGRFGSR 

DALDLOAPREWLSTLPPPRRTRDLDPQPPPLP 

LSPQRQLSRDPUJPSRPLDSLSRSSNSREQLDQ 

VPSRHPSREALGPLPQLLRAREDSVSGPSHGP 

STEQLDILSSILASFNSSALSSVQSSSTPLGPHT 

TATPSATASVLGPSTPRSATSHSrSELSPDSEPR 

DTQALLSATQAMDLRRRDYHMERPLLNQEH 

LEELGRWOSAPRTHQWRTWLQCSRARAYAL 

LLQHLPVLVWLPRYPVRDWIXGDIXSGLSVA 

IMQLPQGLAYALLAGUPVFGLYSSFYPVFIY 

FLFGTSRHISVESLCVPGPVDT 


911 


2261 


A 


7890 


21 


806 


EFGTSRSSRSMAEDLGLSFGETASVHULPEHG 

SCJIPKARSSSARWALTCXXVLLPFLAGLTTYL 

LVSQLRAQGEACVQFQALKGQEFAPSHQQV 

YAPLRADGDKPRAHLTWRQTPTQHFKNQFP 

AUWEHEWLAFIKNRMNYTNKFLLEPESGD 

YFIYSQVTFRGMTSECSEIRQAGRPNKPDSrrV 

VmCVTDSYPEPTQLLMGTKSVCEVGSNWFQ 

rl 1 i^uAMr ol^(2<^tJJ\ij>M ViVI VaUioL VD Y I 

DKTFFGAFLL 


912 


2262 


A 


7891 


1263 


111 


ACGIRHEGALPGLTATPBAMLRFLPDLAFSFL ' 

LILAIX3QAVQFQEYVFLQFLGLDKAPSPQKFQ 

PVPYHJCKIFQDREAAATTQVSRDLCYVKELG 

VRGfNVLRFLH)QOFFLYPKKlSQASSCLQKLL 

YFNLSADCEREQLTLAQLGLDLGPNSYYNLGP 

ELELALFLVQEPHVWGQTTPKPGKMFVLRSV 
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SEQ ED 
NO: of 

eotide 
seq- 
uence 


SEQ ID 
NO: of 

pcpuuc 

seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

conespondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=CystBine, 
D=Aspartic Acid, E=Clutamic Acid, 
F=*lienylalanine, 0=Glycine, H=Histidine, 
Hlsoleudne, K=Lysine, L=Leucine; 
N^Methionine, N=Asparagine, P=4*roline, 
Q=01utaniiDe, R=Aixinine, S=^erine, 
T=Threonine, V=Valine, W=Tiyptophan, 
Y=Tyrosine, X=Unknown, •=Stop codon, 
A^ssible nucleotide deletion, \=possible 
nucleotide insertion 














PWPQOAVHFNLLDVAKDWNDNPRKNFGLFL 

EILVKEDRDSGVNFQPEDTCARLRCSLHASLL 

VAniJ«JFDQaiPSRKRRAAIPVPKLSCKNLCH 

RHQUTNFRDLGWHKWIIAPKGFMANYCWGB 

CPFSLTISLNSSNYAFMQALMHAVDPmPQAV 

OirnaSPISMLYQDNNDNVILRHYEDMVVD 

ECGCG 


913 


2263 


A 


7892 


15 


849 


AaRLPRGPGCGADMRPLLGLLLYFAGCTFAL 

YLLSTRLPRGRRLGSTEEAGGRSLWFPSDLAE 

LRELSEVLREYRKEHQAYVFLLFCGAYLYKQ 

GFAIPGSSFLNVLAGALFGPWLGLLLCCVLTS 

VGATCCYLLSSIFGKQLWSYFPDKVALLQR 

KVEENRNSLFFHXFUUJTNITIWWFLNLSAPI 

LNmVQFFFSVUGLIPYNFICVQTGSILSTLTS 

IJJAU^WDTVFiajJUAMVALIPGTLIKKFSQ 

KHLQLNETSTANHIHSRKDT 


914 


2264 


A 


7893 


81S 


959 


KSGWVWWLTPLIPALWEAQTEOSLRPEVKN 
RLSNTTRPFFSKKKKILV 


915 


2265 


A 


7909 


3 


641 


HASGPGGLLRRRRGSGANMPVARSWVCRKT 

YVTPRRPFEKSRLDQELKLIGEYGLRNKREV 

WRVKFTLAKIRKAARELLTLDBKDPRRLFEG 

NALLRRLVRIGVLDEGKMKLOYILGLKIEDFL 

BRRLQTQVFKLOLAKSIHHAHVLIQQCHIRVR 

EQVVNIUTTVRLDSQKHIDFSLCFPIGVANPS 

HVKRKNASKGQGGAGARDDEEEE 


916 


2266 


A 


7914 


3 


967 


VAHTQWHTCQRLSQLTHRSILKYLLIDTHAC 

QVLILKHTHASLSLPSCXJECFPSSIPSASHMVS 

HPHPPPSPRWGQTPEGLPAASPCGPGPRSCFS 

SILPTGDSWGMLACLCTVLWHLPAVPALNRT 

GDPGPGPSIQKTYDLTRYLEHQLRSLAGTYLN 

YLGPPFNEPDFNPPRLGAETLPRATVDLEVW 

RSLNDKLRLTQNYEAYSHLLCYLRCa^IRQAA 

TAELRRSLAHFCTSLQGLLGSIAGVMAALGY 

PLPQPLPGTEPTWTPGPAHSDFLQKMDDFWL 

LKELQTWLWRSAKDFNRLKKKMQPPAAAVT 

LHLGAHGF 


917 


2267 


A 


7921 


2 


1166 


RPRRGQGLVQEVQTENVTVAEGGVAEITCRL 

HQYDGSIWIQfffARQTLFFNGTRALKDERFQ 

LEEFSPRRVRKLSDARLEDEGGYFCQLYTED 

THHQIATLTVLVAPH^WEVREQAVEGGEV 

ELSCLVPRSRPAATLRWYRDRKELKGVSSSQ 

ENGKVWSVASTVRFRVDRKDDGGDICEAQN 

QALPSGHSKQTQYVLDVQYSPTARIHASQAV 

VREGDTLVLTCAVTGNPRPNQIRWNRGNESL 

PERAEAVGETLTLPQLVSADNOTYTCEASNK 

HGHARALYVLWYGESRLRPTEGGOGAPDP 

GAVVEAQTSVPYAIVGGILAU.VFUICVLVG 

MVWCSVRQKGSYLTHEASGLDEQGEAREAF 

LNGSDGHKRKEEFFI 


918 


2268 


A 


7938 


3 


2653 


RRRLPPASPPSSSVSSSLSPSAWMACRWSTK. 
ESPRWRSALLXLFLAGVYGNGALAEHSENVH 

ioU Vol rl^ui!. 1 r IM^lKAJr ovjJJ, 1 OXAJ W X^oJb X j'/VIv 

INCSWFIRANPGEimSFQDFDIQGSRRCNLD 

WLTEETYKNlESYRACGSnPPPYISSQDHrWIR 

FHSDDNISRKGFRLAYFSGKSEEPNCACDQFR 

CGNGKClPEAWKCNNMDECGDRSDEaCAKE 

ANPPTAAAFQPCAYNQFQCLSRFTKVYTCLP 

ESLKOXJNIDCLDLGDEIDCDVPTCGQWIJCY 

FYGlTOSPNirPDFYPPGSNCTWLIDTGDHRK 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspond! 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
correspcmding 
to last amino 
add residue 
of peptide 
sequence 


Amino add sequence (A=Alanine OCysteine, 
I>=Aspaitic Add, E=Olutamic Acid, 
F=^»h6nylalanine, G=Glycine, H=4Iistidine, 
I=Isolcucinc, K=Lysine, I^Leucine, 
M=MBt]iioiiine^ N=A5paiagine, P=Proline, 
Q=GlDtainine, R=Aiginlne, S=SeriDe, 
T=Threonine, V=Valine, W=Ttyptt^han, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
A°possible nudeotide ddetion, V^ossible 
nucleotide insertion 














VILRFTOFKUXjTGYGDYVKIYDGLEENPHK 

LLRVLTAFDSHAPLTWSSSGQIRVHFCADKV 

NAARGFNATYQVDGFCXPWEIPCGGNWGCY 

TEQQRCDGYWHCPNGRDETNCmCQKffiFP 

CSRNGVCYPRSDRCNYQNHCPNGSDEKNCFF 

CX3IX3NFHCKNNRCVFESWVCDSQDDCGI)GS 

DEENCPVIVPTRVITAAVIGSLICGLLLVIALG 

CTOKLYSLRMFERRSFETQLSRVEAELLRREA 

PPSYGQLIAQGLIPPVEDFPVCSPNQASVLENL 

IUAVRSQLGFTSV]UJ>MAGRSSNIWNRIFNFA 

RSRHSGSLALVSADGDEWPSQSTSREPERNH 

THRSLFSVBSDDTDTENERRDMAGASGGVAA 

PLPQKVPPTTAVEATVGACASSSTQSTRGGH 

ADNGRDVTSVEPPSVSPARHQLTSALSRMTQ 

GLRWVRFTLGRSSSLSQNQSPLRQLDNGVSG 

REDDDDVEMLIPISDGSSDFDVNDCSRPLLDL 

ASDQGQGLRQPYNATNPGVRPSNRDGPCERC 

GIVHTAQIPDTCLEVTLKNETSDDEAiXLC 


919 


2269 


A 


7951 


1674 


1839 


WRVTCCPPARSITERTNAYDEEDCVEMVAS 
GOWNDVACinTMYFMCEFDKKNM 


920 


2270 


A 


7953 


47 


572 


GGRASWPEQAKEPRREGHTDKQQTEDVLAA 

GLRCLPHLPAICARRMSPAFRAMDVEPRAKG 

VLLEPFVHQVGGHSCVLRFNETTLCKPLVPRE 

HQFYETLPAEMRKFTPQYKGKSQLLEGLPHW 

RGDVRDRGHGRPWQPSLEPSLPPTLCFPSLSS 

FSSSWPSAQHLTPSVFNPW 


921 


2271 


A 


7957 


612 


812 


RSGRTWTGIGYSKALQSSNRNTKSLLQNEF 

MMVYSFRALSFKESTWATFQHGGEATKSRSL 

SSTQ 


922 


2272 


A 


7967 


1443 


1660 


ENTTEKWKEIWMCRGNKKSCCWTFIKDRHLT 
VSCCKSKSGEnXICIFCSNLVGEFFFGIRGFSN 
WELVKPN 


923 


2273 


A 


7981 


1 


3023 


GSAPRAATAMARARPPPPPSPPPGLLPLLPPLL 

UJIXUJ'AGCRAI^TOlDTKWVTSELAWr 

SHPESGWEEVSGYDEAMNPIRTYQVCNVRES 

SQNNWLRTGFIWRRDVQRVYVELKFTVRDC 

NSIPNIPGSCKETFNLFYYEADSDVASASSPFW 

MENPYVKVDTIAPDESFSRLDAGRVNTXVRS 

FGPLSKAGFYLAFQDQGACMSLISVRAFYKK 

CASTTAGFALFPETLTGAEPTSLVIAPGTCIPN 

AVEVSVPLKLYCNGDGEWMVPVGACTCATG 

HEPAAKESQC31PCPPGSYKAKQGEGPCLPCPP 

NSRTTSPAASICTCHNNFYRADSDSADSACTT 

VPSPPRGVISNVNETSLILBWSEPRDLGVRDD 

LLYNVICKKCHGAGGASACSRCDDNVEFVPR 

QLGLSEPRVHTSHLLAHTRYTFEVQAVNGVS 

GKSPLPPRYAAVNrrTNQAAPSBVPTLRLHSS 

SGSSLTLSWAPPERPNGVILDYEMKYFEKSEG 

lASTVTSQMNSVQLDGLRPDARYWQVRART 

VAGYGQYSRPAEFETTSEROSOAQQLQEQLP 

LrVGSATAGLVFVYAVWIATVCXRKQRHGS 

DSEYTEKLQQYIAPGMKVYIDPFTYEDFNEA 

VREFAKEIDVSCYKIEEVIGAGEFGEVCRGRL 

KQPGRREVFVAIKTLKVGYTERQRRDFLSEA 

SIMGQFDHPNIIRLEGVVTKSRPVMILTEFME 

N(l\LDSFLRLNDGQFrVIQLVOMLRGIAAGM 

KYLSEMNYVHRDLAARNILVNSNLVCKVSDF 

GLSRFLEDDPSDPTYTSSLGGKIPIRWTAPEAI 

AYRKFTSASDVWSYGIVMWEVMSYGERPY 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO; of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

coire^ondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
D=Aspaitic Add, EMjlutamic Acid, 
F«=Phraiylalanine, G=Glycine, H^istidine, 
I<°Isoleucinc, K=4.ysjne, L^Ceucine^ 
M=4^1ethionine, N=Aspaiagine, P=Prolinc, 
Q=Glutamine, R=Aigininei, S=Serine, 
T^Threonine, V=Valine, W=TryptDphan, 
Y=Tyrosine, X=<Jnknown, ♦=Stop codcHi, 
/"possible nucleotide deletion, ^=possible 
nucleotide insntion 














WDMSNQDVINAVEQDYRLPPPMDa»rALHQ 

LNODCWVRDRNLRPKFSQIVNTLDKLIKNAA 

SLKVIASAQSGMSQPLLDRTVPDYrrFITVGD 

WLDAIKMGRYKESFVSAGFASFDLVAQMTA 

EDLUUGVUAGHQKKILSSIQDMRLQMNQT 

LPVQV 


924 


2274 


A 


7985 


1 


503 


FRPRTKKATAMYLEHYLDSIENLPCEUJRNF 

QIAOELDQRTEDKKAETOILAAEYISTVKTLS 

PDQRVERLQKIQJNAYSKCKEYSDDKVQLAM 

QTYEMVDKHIRRLDADLARFEADLKDKMEO 

SDFESSGGROLKKQROQKEKROSRGRGRRTS 

EEDTPKKKKHKGO 


925 


2275 


A 


7994 


447 


589 


LPCSFCAQCMSSFERVWLQQSHFHNPRWNSR 
SPIRCYCQHWPHCVHC 


926 


2276 


A 


7996 


925 


582 


OPCXVCCniAIMLQCHSFyRKDVQVEHPKS 
LNPKYSQIENFLSADMALKRKCLLSISDLDFW 
IWDAQPVGJMQTLQNLKKIPNPGCFWSQAFQI 
RDTQPILPLGGRYYmRQ 


927 


2277 


A 


7998 


2 


353 


riorplnsrspnhslfvkapi takoatmift 
CLLLVTLALCCYQANAEFCPALVSELLDFFH 
S]ffLFKLSLAKFDAPPEAVAAKLGVkRCrDQ 
MSLQKRSLIAEVLVKILKKCSV 


928 


2278 


A 


8004 


130 


588 


LAPLRCQPGTRTQPRSHPAANDPSAAMSAAG 
ARGLRATYHRLLDKVELMLPEKLRPLYNHPA 
GPRTVFFWAPIMKWGLVCAGLADMARPAEK 
LSTAQSAVLMATOFIWSRYSLVnPKNWSLFA 
VNFFVOAAGASQLFRIWRYNQELKAKAHK 


929 


2279 


A 


8007 


2 


1016 


EFARHRVFIAAREMSLLRSLRVFLVARTGSYP 

AGSLLRQSPQPRHTFYAGPRLSASASSKELLM 

KLRRKTGYSFVNCKKAI ETrGGDI K OAFIWl 

HKEAQKEGWSKAAKLQGRKTKEGLIGLLQE 

QhTTTVLVEVNCETDFVSRNLKFQLLVQQVAL 

GTMMHCQTLKDQPSAYSKGFLNSSELSGLPA 

GPDREGSLKDQIjVL/UGKLGENMILKRAAWV 

KVPSGFYVGSYVHGAMQSPSUIKLVLGKYG 

ALVICETSEQKTNLEDVGRRLGQHWGMAPL 

SVGSLDDEPGGEAETKMLSQPYLLDPSrrLGQ 

YVQPQGVSVVDFVRFECX3EGEEAAETE 


930 


22S0 


A 


8008 


3 


1679 


NSRVWGPWTEPSAGSLRPMARK.QNRNSKEL 

GLVPLTDDTSHAGPPGPGRALLECDHLRSGV 

PGGRRRKDWSCSLLVASLAGAFGSSFLYGYN 

LSWNAPTPYIKAFYNESWERRHGRPIDPDTL 

TLLWSVTVSIFAIGGLVQTLIVKMOKVLORK 

HTXLANNGFAISAALLMACSLQAOAFEMLIV 

GRFIMGIDGGVALSVLPMYLSEISPKEIRGSLG 

QVTAlFiaGVFTGQLLGLPELLGKESTWPYLF 

GVIWPAWQLLSLPFLPDSPRYLLLEKHNEA 

RAVKAFQTFLGKADVSQEVEEVLAESRVQRS 

IRLVSVLELLRAPYVRWQWTVIVTMACYQL 

CGLNATWFYTNSIFGKAGIPPAKIPYVTLSTGG 

lETLAAVFSGLVDEHLGRRPLLIGGFGLMGLFF 

GTLTITLTLQDHAPWVPYLSIVGILAllASFCSG 

PGGDPFELTGEFFQQSQRPAAFIIAGTVNWLSN 

FAVGLLFPnQKSLDrYCFLVFATICrrGAIYL . 

YFVLPETKNRTYAEISQAFSKRNKAYPPEEKI 

DSAVTDGKINGRP 


931 


2281 


A 


8009 


861 


300 


AAGAWSAMPKAKGKTRRQKFGYSVNRKRL 
NRNARRKAAPRIECSHIRHAWDHAKSVRQNL 
AEMGLAVDPNRAVPLRKRKVKAMEVDIEER 
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PCTAJSO1/0380O 



SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

concspondi 

ng to first 

amino acid 

residue of 

pqitide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=CystBine, 
D=Aspartic Acid, B=CIutamic Acid, 
F=4*aiylalanine, G=Glycine,H=Histidine, 
Msoleudne, K=Lysinei Lr=Leucine, 
M=^ethionine, N=Asparagine, P=ProUne, 
Q=Ghitanune, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tiyptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/H>o^>ble nucleotide deletion, V=possible 
nucleotide insertion 














PKELVRKPYVLNDLEAEASLPEKKGNTLSRD 
LIDYVRYMVENHGEDYKAMARDEKNYYQD 
TPKQIRSKINVYKRPYPAEWQDFLDSLQKRK 
MEVE 


932 


2282 


A 


8011 


412 


1 


SNLCLGNSWRWRWAKSRHHCIPTVnnLSKRSG 

DIRGSHFSSPQRQRSQRVPGKETARVLRAGK 

QGRGQIPIPCPWPPPPPPPPPGSPGPGCRQFHQ 

SLEAKARHPASVREMRGKVKMRRALRRAPA 

STRASSRQPNPK 


933 


2283 


A 


8012 


147 


1077 


PPVPPASRSDMAQNLKDLAGRLPAGPRGMGT 

/UXLLLGAGAVAYGVRESVFTVEGGHRAIFF 

NRlGGVQQDTlLAEGLHFRIPWFQYPnYDlRA 

RPRKISSPTGSKDLQMVNISLRVLSRPNAQEL 

PSMYQRLGLDYEERVLPSIVNEVLKSWAKF 

NASQLITQRAQVSUJRRELTERAKDFSLILDD 

VAITELSFSREYTAAVEAKQVAQQEAQRAQF 

LVEKAKQEQRQKTVQAEGEAEAAKMLGEAL 

SKNPGYDCLRKIRAAQNISKTIATSQNRIYLTA 

DNLVLNLQDESFTRGSDSLIKGKK 


934 


2284 


A 


8023 


255. 


982 


SQFSLSQVLVDSAEEGSLAAAAELAAQKREQ 

RLRKFRELHLMRNEARKLNHQEWEEDKRL 

KLPANWEAKKARLEWELKEEHCKKECAARG 

EDYEKVKIXEISAEDAERWERKXKKKNPDLG 

FSDYAAAQLRQYHRLTKQIKPDMETYERLRE 

KHGEEFFPTSNSLLHGTHVPSTEEIDRMVIDLE 

KQIEKRDKYSRBRPYNDDADIDYINERNAKF 

NKKAERFYGKYTAEIKQNLHROTAV 


935 


2285 


A 


8027 


59 


310 


LVSSTVNLLTEKAPWNSLAWTVTSYVFLKFL 
QGGGTGSTGMRDSALTLLGIGPSHRHSLSIRL 
SQHSSPAPMYSQTFHn.VLG 


936 


2286 


A 


8032 


1 


639 


SGREO^MAKTYDYIJKLLUGDSGVGKTCVL 

FRFSEDAFNSTnsnGIDFKIRTIELDGKRIKLQ 

IWDTAGQERFRTITTAYYRGAMGIMLVYDrr 

NEKSFDNIRNWIKNIEEHASADVEKMILGNICC 

DVNDKRQVSKERGEKLALDYGIKFMETSAK 

ANINVENAFFTLARDKAKMDKKLEGNSPQG 

SNQGVKTTPDQQKRSSFFRCVLL 


937 


2287 


A 


8039 


393 


311 


EETIHSENSYILEKYIPISANLTLTIA 


938 


2288 


A 


8052 


675 


1334 


LHPAATSTAWIJIVPPGLSMALSWVLTVLSLL 

PLLEAQIPLCANLVPVPrrNATLDRrrGKWFYI 

ASAFRNEEYNKSVQEIQATFFYFTPNKTEDTIF 

LREYQTRQDQOYNTrYLNVQRENGTISRYV 

GOQEHFAHIJLnJUnKTYMLAFDVNDEKNW 

GLSVYADKPETTKEQLGEFYEALDCUUPKSD 

A^VYTDWKKDKCEPLEKQHEKERKQEEGES 


939 


2289 


A 


80S5 


12 


1039 


SSVAEFPERVQLSQPQNWNFSGAGGAWStDF 

AEQLKWSAELARLGESIMDGKQGGMDGSKP 

AOPRDFPGKLLSNPLMODAVSDWSPMHEAA 

IHGHQLSLRNLISQGWAVNirrADHVSPLHEA 

OXXjHLSCVKILLKHGAQVNGVTADWHTPL 

FNACVSGSWDCVNLLLQHQASVQPESDLASP 

WEAARRGHVECVNSLIAYQGNIDHKISHLGT 

PLYLACENQQRACVKBCLLESQADVNQQKOQ 

DSPLHAVARTASEELACLLMDFOADTQAKN 

AEGKRPVELVPPESPLAQLFLEREGPPSLMQL 

OORIRKCTGIQQHHKITKLVLPEDLKQFLXH 

L 


940 


2290 


A 


8038 


2 


1203 


KVLSIREPAHSTARKASEPSQPSQPSQPGGHLI 
ARLRTNJDLHLFDYSEPGNFSDISWPCWSSDa 
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PCT/DSOl/03800 



SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

conespondi 

ng to first 

amino add 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=AIanine 0=Cysteine, 
I>=Aspartic Add, E^Ghitamic Add, 
F=Phenylalaninei, G=<jlycin^ H=Histidine, 
I=Isaleucine, K=Cysine, L=Leucine, 
M^^ethionine, N=Asparagine, P=ftoline, 
Q=Glu1amine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y='iyrosine, X^nknown, ♦=Stop codon, 
/=possible nudeotide deletion, V=possible 
nucleotide insertion 




• 










VVDTVMCTNMPNKSVLLYTLSFIYIFIFVIGKII 

ANSVWWVNIQAKTTGYDTHCYn-NLAIADL 

WWLnPVWWSLVQHNQWPMGELTCKVra 

UFSINLFGSmFLTCMSVDRYLSITYFTNTPSS 

RKKMVRRWCILVWLLAFCVSLPDTYYLKT 

VTSASNNETYOlSFyPEHSIKEWUGMELVSV 

VLGFAVPFSDAVFYFLLARAISASSDQEKHSS 

WUIFSYVVVFLVCWLPYHVAVLLDIFSILHYI 

PFTCRLEHAUTALHVTQCLSLVHCCVNPVL 

YSFINWfmYELMKAFEFKYSAKTGLTKLIDA 

SRVSETEYSALEQSTK 


941 


2291 


A 


8059 


73 


432 


DMAGUliITrvrrSLLFLGVCAHHIlPTGSVVLPS 
PCCMFFVSKRffiENRWSYQLSSRSTCLKAGV 
IFTTKKGQQFCGDPKQEWVQRYMKNLDAKQ 
KKASPRARAVAVKGPVQRYPGNQTTC 


942 


2292 


A 


8067 


278 


1262 


GGIGEIKQRPSCLGRCLDPSLSVLMNISLGLGS 

VFSAVISQKPSRDICQRGTSLTIQCQVDSQVT 

MMFWYRQQPGQSLTLIATANQJGSEATYESGF 

VIDKFPISRPNLTFSTLTVSNMSPEDSSIYLCSA 

GRQGTYEQYFGPGTRLTVTEDLKNVFPPEVA 

VFEPSEABISHTQKATLVCXATOFYFDHVBLS 

WWVNGKEVHSGVSTDPQPLKEQPALNDSRY 

CLSSRLRVSATFWQNPRNHFRCXJVQFYGLSE 

NDEWTQDRAKPVTQIVSAEAWGRADCGFTS 

ESYQQGVLSATILYEILLGKATLYAVLVSALV 

LMAMVKRKDSRG 


943 


2293 


A 


8070 


1 


879 


MVKWPATRGNLPRSQLTGTHQHCQPREPKI 

TASERLRRRPRATARLRAHAAPPEPPLAVFAP 

PSDRKELLALPVACDPVIASVMSWVQAASU 

QGPGDKGDVFDEEADESLLAQREWQSNMQR 

RVKEGYRDGIDAGKAVTLQQGFNQGYKKGA 

EVILNYGRLRGTLSALLSWCHLHNNNSTLINK 

INNLLOAVOQCEEYVLKHLKSITPPSHWDLL 

DSlEDMDLarVVPAEKKJDEAKDERLCENNA 

EFNKNCSKSHSGIDCSYVECCRTQEHAHSGK 

PKPHKOJFGTDSQF 


944 


2294 


A 


8073 


I 


797 


ESARWSRQLRRTLIRLSFPISCtiRSHAFGGCIC 

MAATSGTOEPVSGELVSVAHALSLPAESYQN 

DPDIEMAWAMRAMQHAEVYYKLISSVDPQF 

UaXKVDDQIYSEFRKNFETLRIDVLDPEELK 

SESAKEKWRPFCUCFNGIVEDFNYGTLLRLD 

CSQGYTEENTIFAPRIQFFAIEIARNREGYNKA 

VYISVQDKEGEKGVNNGGEKRADSGEEENT 

KNGGEKGADSGEEKEEGINREDKTDKGGEK 

OKEADKEINKSGJaCAM 


945 


2295 


A 


8074 


2 


505 


GAATLLRSASSAARKAAEAEQVWLHLHRYL 

SADRRVLGLREWGRPASERECSLCQRLKREL 

NMGDVEKGKKIFIMKCSQCHTVEKGGKHKT 

GPNLHGLFGRKTGQAPGYSYTAANKNKOnW 

GEDnAffiYLENKCKYIPOTKMIFVGIKKKEER 

ADLIAYLKKATNE 


946 


2296 


A 


8081 


42 




x^inwD nwanvi noin 'xrwtsvhJCA'oQiiXjtTYin v 
cA3tsj\\JCLr\jijf^\^riri^r xriloiNaAiSaKMJLI VJUr 

VAIFAVPLILGQEYBDEERLGEDBYYQWYY 

YTVTPSYDDFSADFTIDYSIFESEDRLNRLDK 

DITE/UETTlSLETARADHPKPVTVKPVTEa>Q 

SPRSEAMPCPVLRSPIPLPPVRVPLFRWGCISC 

KKVGRRLLMTLWMGVWQEEIOR 


947 


2297 


A 


8084 


322 


549 


GGGSSPRELAGAAGLTVTSQAVAARRQQPSF 
SRARAPAHSLRAALSLASSARSWGAVSRDRG 
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wo 01/57188 



PCT/USOl/03800 



SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beeinnina 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
conespcMiding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
I^Aspartic Acid» &=Glutanuc Acid, 
F^flieoylalanine, G=Glycinei H=Histidine, 
I^Isoleucine, K=Lysine, L°Leucine, 
M=4r{etluoninc, N=Aspaiagine, P=Proline, 
Q=Glutaimne; R=Aigiijine, S=Serine, 
T=Threonine, V=Valine, W=Tryptopiian, 
Y=^yrosine, X=Unknown, '^Stop codon, 
/=70ssible nucleotide deletion, V=possible 
nucleotide insertion 














PCPPAIMYQSSNKC 


948 


2298 


B 


8093 


3905 


846 


MEPGEVKDRILENISLSVKKLQSYFAACEDEl 

PAIPNHDKVLQRLCEHLDHALLYGLQDLSSG 

YWVLWHFTRREAIKQIEVLQHVATNLGRSR 

AWLYLALNENSLESYLRLFQENLGLLHKYYV 

KNALVCSHDHLTLFLTLVSGLEFIRFELDLDA 

PYLDLAPYMPDYYKPQYLLDFEDRLPSSVHG 

SDSLSLNSFNSVTSTNLEWDDSAIAPSSEDYD 

FGDVFPAVPSVPSTDWEDGDLTDTVSGPRST 

ASDLTSSKASTRSPTQRQNPFNEEPAETVSSS 

DTTPVHTTSQEKEEAQALDPPDACTELEVIRV 

TKKKKIGKKKKSRSDEEASPLHPACSQKKCA 

KQGDGDSRNGSPSLGRDSPDTMLASPQEEGE 

GPSSTTESSERSEPGLLBPEMKDTSMERLGQPL 

SKVIDQLNGQLDPSTWCSRAEPPDQSFRTGSP 

GDAPERPPLCDFSEGLSAPMDrYRFTVESPST 

VTSGGGHHDPAGLOQPLHVPSSPEAAOQEEE 

GGGGEGQTPRPLEDTTREAQH^QLSLVRE 

GFVSEPEPGTQEVLCQUCRDQPSPCLSSAEDS 

GVDEGQJGSPSEMVHSSEFRVDNNHLLLLMIH 

VFRENEEQLFKMIRMSTGHMEGNLQLLYVLL 

TDCYVYLLRKGATEKPYLVEEAVSYNELDY 

VSVGUDQQTVKLVCTNRRKQFLLDTADVAL 

AEFFLASLKSAMDCGCREPPYPSILTDATMEK 

LALAKFVAQESKCEASAVTVRFYGLVHWED 

T3TT\T7CT ^im>r'ljr*ODDI?/^'ITFVI3/^X>fl OW A /^T 

r 1 \jc,ii\Ajr i V\^tvJaVrc\3 111 lvcijlWl>rt x KAij 1 

SYLGKEHWKTCFWLSNGILYQYPDRTDVIP 

LLSVNMGGEQCX3GCRRANTTDRPHAFQVILS 

DPPCLELSAESEAEMAEWMQHLCQAVSKGVI 

PQGVAPSPCIPCCLVLTDDRLFTCHEDCQTSF 

FRSLGTAKLGDISAVSTEPGKEYCVLEFSQDS 

QQLLPPWVIYLSCTSELDRLLSALNSGWK'nY 

QVDLPHTAIQEASNKKKFEDALSUHSAWQR 

SDSLCaiGRASRDPWC* 


949 


2299 


A 


8095 


9 


2374 


ARRADTVLLESPSMLQOLLPVSLLLSVAVSAI 

KELPGVKKYEWYPIRLHPLHKREAKEPEQQ 

EQFETELKYKMTINGKIAVLYLKKNKNLLAP 

GYTEIYYNSTGKXrrrSPQIMDDCYYQGHILN 

EBCVSDASBTCRGLRGYFSQGDQRYFIEFLSPI 

HRDGQEHALFKYNPDEKNYDSTCGMDGVL 

WAHDLQQNIALPATKLVKLKDRKVQEHEKY 

lEYYLVLDNGEFKRYNENQDEIRKRVFEMAN 

YVNMLYKKLNTHVALVGMErWTDKDKDOT 

PNASFIUaJFSKWRGSVLSRRKRHDUQUTA 

TELAGTTVGLAFMSTMCSPYSVGWQDHSD 

NLLRVAGTMAHBMGHNFGMFHDDYSCKCPS 

TICVMDKALSFYIPTDFSSCSRLSYDKFFEDKL 

SNCLFNAPLPTDDSTPlCGNQLVHvlGEDCDC 

GTSEECTNICXaDAKTCKJKATFQCALGECCEK 

CQFKKAGMVCRPAKDECDLPEMCNGKSGNC 

PDDRFQVNGFPCHHGKGHCLMGTCPTLQEQ 

CTELWQPGTEVADKSCYNRNEGOSKYGYai 

RVDDTLIPCKANDTMCGKLFCQGGSDNLPW 

KGRTVTFLTCKTFDPEDTSQEIGMVANGTKCG 

DNKVCINAECVDIEKAYKSTNCSSKCKGHAV 

CDHELQCQCEEGWIPPDCDDSSVVFHFSrWG 

VLFPMAVIFVWAMVIRHQSSREKQKKDQRP 

LSTTGTRPHKQKRKPQMVKAVQPQEMSQMK 

PHVYDLPVEGNEPPASFHKDTNALPPTVFKD 

NPMSTPKDSNPKA 
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PCT/USOl/03800 



SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleoticte 

location 

conespondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide . 
location 
corresponding 
to last amino 
scid residue 
of peptide 
sequence 


Amino acid sequence (A=AlBnine OCysteine, 
D=TAspartic Acid, E=Glutaimc Acid, 
F^Phenylalanine, G=Glycine, HNHstidine, 
^Isoleucin^ K^Lysinc, L'^^cine^ 
M=^etfai(Hiine, N=Aspaiagine, P=Proline, 
Q=GlutBmine, R=Aiginine, S=SHtie, 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
^=possible nucleotide deletion, V=^ossible 
nucleotide insertion 


950 


2300 


A 


8100 


1 


1251 


MGLLLMILASAVLOSFLTLLAQFFLLYRRQPE 

PPADEAARAGEGFRYDCPVPGUJLREYLYGG 

GRDEEPSGAAPEGGATPTAAPETPAPPTRETC 

YFLNATIUlfREUU)TALTRRWVTKKIKVEF 

EELLQTKTAGRLLEOLSLRDVFLGETVPFIKTI 

RLVRPWPSATOEPDGPEGEALPAACPEELAF 

EAEVEYNGGFHLAIDVDLVFGKSAYLFVK1,S 

RWGRLRLVFTRA^PFTHWFFSFVEDPLIDFEV 

RSQFEGWMPQLTSirVNQUaaiKKKHTLPNY 

KERFKPFFPYQTLQGFEEDEBHiraQQWALTE 

GMJCVTLLECSafOXIFGSYDREANVHCTLELS 

SSVWEEKQRSSKTGTISLTAVFMGWHRVSE 

AITGLWYHXVDLPFWGLEDGGPLLTVPLRQ 

CPG 


951 


2301 


A 


8108 


1612 


839 


EVALFCFEMAAGMYLEHYLDSIENLPFELQR 

NFQLMRDLDQRTEDLKAEEDKLATEYMSSAR 

SLSSEEKLALLKQIQEAYGKCKEFGDDKVQL 

AMQTYEMVDKHIRRLiyrDLARFEADLKEKQI 

ESSDYDSSSSKGKKKGRTQKEKKAARARSKG 

KNSDEEAPKTAQKKLKLVRTSPEYGMPSVTF 

GSVHPSDVLDMPVDPNBPTYOLCHQVSYGE 

MIGCDNPDCSIEWFHFACVGLTTKPRGKWFC 

PRCSQERKKK 


952 


2302 


A 


8112 


595 


291 


PSVASLARRFSGRALWPPSHSVPGNRALCPRL 
LHOTTLPGGNQRELARQKNMKKQSDSVKGK 
RRDDGLSAAARKQRDSTPRDSEIMQQKQKK 
ANEKKEEPK 


953 


2303 


A 


8118 


1 


669 


VCAGIRDPCSTPLAKPAAGGAENLSFGKQPG 

LETNILKMTrPNKTPPGADPKQLERTGTVREI 

GSQAVWSLSSCKPGFGVDQLRDDNLETYWQ 

SIXJSQPHLVMQFRKKTTVICrLCIYADYKSDE 

SYTPSKISVRVGNNFHNLQBIRQLELVEPSGW 

IHVH.TDNHKKPTRTFMIQIAVLANHQNGRD 

THMRQKIYTPVEESSIOKFPRCTTIDFMMYRS 

IR 


954 


2304 


A 


8133 


66 


1015 


PPLH>RSFPNLFSRPEPLPEPGRRGCNRSREPA 

ARAPSPPPPFEGAPGRAMVKVTFNSALAQKE 

AKKDEPKSGEEALnPPDAVAVDCKDPDDW 

PVGQRRAWCWCMCFGLAFMLAGVILGGAY 

LYKYFALQPDDVYYCGDCYIKDDVILNEPSAD 

APAALYQnEENDOFE^VEFISVPVPEFADS 

DPANIVHDFNKKLTAYLDLNLDKCYVIPLNT 

SIVMPPRNLLELLINIKAGTYLPQSYUHEHMV 

FTORIENroHUJFFIYRLCHDKEryKLQRRETI 

KGIQKREASNCFA1RHFH«FAVETLICS 


955 


2305 


A 


8143 


35 


1171 


VESRSAWHEGEDQIDRLDFIRNQMNLLTU5V 

KXKlKEVTEEViWKVSCAMTDEICRLSVLVD 

EFCSEFHPNFDVUOYKSHJ^HIEDGMGRNL 

ADRCTDEVNALVLQTQQEIIENLKPLLPAGIQ 

DKLHTUPCXKFDLSYNLNYHKLCSDFQEDIV 

FRFSLOWSSLVHRFLGPKNAQRVLLGLSEPIF 

QLPRSLASrPTAPTTPATPDNASQEELMrrLVT 

GLASVTSRTSMGinVGGVIWKTIGWKLLSV& 

LTOYGALYLYERLSWTTHAKERAFKQQFVN 

YATEKLRMIVSSTSANCSHQVKQQIATTFARL 

CQQVDITQKQLEEEIARLPKEIDQLEKIQNNS 

KLLRNKAVQLENELENFTKQFLPSSNEES 


956 


2306 


A 


8157 


1854 


798 


ASGSPAPSSSSAMAAACGPGAAGYCLLLGLH 
IJLLTAGPALGWhDDPDRMLLRDVKALTLHY 
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wo 01/57188 



PCT/USOl/03800 



SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
pcpnuc 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

b^inning 

nucleotide 

location 

coirespondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
conesponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCystdne, 
D=!Aspaitic Acid, E=Ghitanuc Add, 
P°Phenylalanine, G=Glycine, IHSkidine, 
Msolcucinc, K-Lysinc, L=Leucine, 
M=4i<ethioninc, N=Aspaiagine, P=frolinei 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threoninei V=Valine, W=Tiyptophan, 
Y=Tyrosine, X=Unknown, ♦=Stop codon, 
/^■possible nucleotide deletion, V=possibl6 
nucleotide insotion 














DR\i"lSRRLDPIPQLKCVGGTAGCDSYTPKVI 

QCQNKGWDGYDVQWECKTDLDIAYKFGKT 

WSCEGYESSEDQYYLRGSCGLEYNLDYTEL 

GLQKLKESGKQHGFASFSDYYYKWSSADSC 

NMSOLITTVVLLGIAFVVYKLFLSIXjQYSPPP 

YSEYPPFSHRYQRFTNSAGPPPPGFKSEFTGPQ 

NTGHGATSGFGSAFTGQQGYENSGPGFWTGL 

GTGGILGYI-FGSNRAATPFSDSWYYPSYPPSY 

PGTWNRAYSPLHGGSGSYSVCSNSDTKmTA 

SGYGGTRRR 


957 


2307 


A 


8159 


1492 


528 


THWMTOMCYAPHQVLSYINGVTTSKPGVSL 

VYSMPSR^JLSLRLEGLQEKDSGPYSCSVNVQ 

DKQGKSRGHSnCTLELNVLVPPAPPSCRLQGV 

PHVGANVTLSCQSPRSKPAVQYQWDRQLPSP 

QTFFAPALDVIRGSLSLTNLSSSMAGVYVCKA 

HNEVGTAQCNVTLEVSTOPGAAWAGAWO 

TLVQLQLLAGLVLLYHRRGKALEEPANDIKE 

DAIAPRTUPWPKSSDTISKNGTLSSVTSARAL 

RPPHGPPRPGALTPTPSLSSQALPSPRLPTTDO 

AHPQPISPIPGGVSSSGLSRMGAVPVMVPAQS 

QAGSLV 


958 


2308 


A 


8161 


2340 


1192 


ELARRPKQQSSEKSRNMIRNWLTIFILFPLKLV 

EKCESSVSLTVHPWKLENGSSTNVSLTLRPP 

LNATLVrrFEITFRSKNrriLELPDEVVVPPGVT 

NSSFQVTSQNVGQLTVYLHGNHSNQTGPRK. 

FLVIRSSAISIINQVIGWIYFVAWSISFYPQVIM 

NWRRKSVIGLSFDFVALNLTGFVAYSVFNIGL 

LWVPYIKEQFLLKYPNOVNPVNSNDVFFSLH 

AWLTLraVQCCLYERGGQRVSWPAIGFLVL 

AWIJAFVTMrVAAVGVrrWLQFLFCrSYIKL 

AVTLVKYFPQAYMNFYYKSTEGWSIGNVLL 

DFTGGSFSLLQMFLQSYNNDQWTUFGDPTC 

FGLGVFSIVFDWFFIQHFCLYIUCRPGYDQLN 


959 


2309 


A 


8163 


521 


1345 


GERAGRRRGRLGVWAQPQPLLPRPVGSRRE 

MQPPGPPPAYAPTNGDFTFVSSADAEDLSGSl 

ASPDVKLNLGGDFUCESTATTFLRQRGYGWL 

LEVEDDDPEDNKPLLEELDIDLKDIYYKIRCV 

LMPMPSLGFNRQWRDNPDFWGPLAWLFFS 

MISLYGQFRWSWirnWIFGSLTIFLLARVLG 

GEVAYGQVLGVIOYSLLPLIVIAPVLLVVGSF 

EVVSTLIKLFGVFWAAYSAASLLVGEEFKTK 

KPLLIYPIFLLYIYFLSLYTGV 


960 


2310 


A 


8167 


1 


2921 


mtcfkgqkgeqrshafeankdHkakvpspn 
lysqlnalqftvdersilwlnqflldlkqsl 
nqfmavyklndnsksdehvdvrvdglmlk 
fvipsevksechqdqpraisiqssemiatntrh 

CPNamSDLEAlfQDFKDCDFFSKTYTSFPKS 
CDNFNLLHPIFQRHAHEQDTKMHEIYKGNrrp 

qlnkntlktsaatdvwavyfsqfwidyegm 
ksgkgrpisfvdsfplsiwicqptryaesqkep 
qtcnqvslntsqsessdlagrlkrkkllkey 
ystesepltnggqkpsssdtffrfspssseadi 

lll^V JlVrlR-rlV?>iVltjlI>lhl YCjY 1 .1.1 .1 .hl^HKNI 

senlrkdveavtgspasqtsicigillrsaela. 

lllhpvdqantlkspvsesvspwpdylpran 

gdflsskrkqisrdinrirsvtvnhmsdnrsm 

svdlsmplkdpllfksasdtnlqkgisfmdy 

lsdkhlgkisedessglvyksgsgeigsetsd 

kkdsfytdsssvlnyredsnilsfdsdgnqni 

lssn,tskgnetiesifkaedlu>eaaslsenl 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino add 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteioe, 
D=Aspartic Acid, E=Glutamic Add, 
FHPhenylalanine, G=Glycine, It=Histidine, 
I-Isoleudnc, K"Lysinc, L=i*uciiie, 
M=Meai(HiiBe, I^Asparagine, P=Proline, 
Q=Ghitamine, R=Arginine, S=Serinei 
T^Threonine, V=Valine, W=Tiypt(^han, 
Y=Tyn)sine, X=Unknown, ♦=Stop codon, 
A^Ktssible nucleotide deletion, V^ssible 
nucleotide insotion 














DISKEETPPVRTLKSQSSLSGKPKERCPPNLAP 

U:VSYKNMKRSSSQMSIJDTISLDSMILEEQLL 

ESDGSDSHMFLEKGNKKNSTTNYRGTAESVN 

AGANLQNYGETSPDAISTNSEGAQENHDDLM 

SVWFKITGVNGEIDIRQEDTEICLQVNQVTP 

DQLGNISLRHYLCNRPVGSDQKAVIHSKSSPE 

ISLRFESGPGAVIHSLLAEKNGFLQCHIENFST 

EFLTSSLMNIQHFLEDETVATVMPMKIQVSNT 

KlNLKDDSPRSSrVSLEPAPVTVHIDHLWER 

SDDGSFHIRDSHMLNTG>©LKENVKSDSVLL 

TSOKYDLKKQRSVTQATQTSPGVPWPSQSAN 

FPHE'SFDFTREQLMEENESLKQELAKAKMAL 

AEAHLEJCDALLHHIKKMTVE 


961 


2311 


A 


8172 


1442 


682 


TAAMSIFlPlNQUlLTNVAWRMKRACjKRFEl 
ACmjKWGWRSGVEKDLDEVLQTHSVFVN 
VSKGQVAKKEDLISAFOTDDQTEICKQILTKO 
EVQVSDKERHTQLEQMFRDIATIVADKCVNP 
ETKRPYTVELIERAMKDIHySVKTNKSTKQQA 
LEVKQLKEKMKIERAHMRLRFILPVNEGKKL 
KEKLKPLIKVIESEDYGQQLEIVCXIDPGCFREI 
DELIKKETKGKGSLEVLNLKDVEEGDEKFE 


962 


2312 


A 


8175 


286 


587 


haS>JKAEVSSHPSVISHSMDSFGQPRPEDNQS 
VLRRMQKKYWKTKQVFIKATGKKEDEHLVA 
SDAELDAKLEVFHSVQETCTELLKUEKYQLR 
LNGMKS 


963 


2313 


A 


8181 


13 


2215 


AEGCAERRGTEPWELSMSWESGAGPGLGSQ 

GMDLVWSAWYGKCVKGKGSLPLSAHGIW 

AWLSRAEWDQVTVYLFCDDHKLQRYALNRI 

TVWRSRSGNELPLAVASTADLIRCaCLLDVTG 

GLGTDELRLLYGMALVRFVNLISERKTKFAK 

VPLKCLAQEVNIPDWIVDLRHELTHKKMPHI 

NDCRRGCYFVLDWLQKTYWCRQLENSLRET 

WELEEFREGIEEEDQEEDKNIWDDITEQKPE 

PQDDGKSTESDVKADGDSKGSEEVDSHCKK 

ALSHKELYERARELLVSYl^EQFTVLEKFRYL 

PKAKAWNNPSPRVECVLAELXGVTCENREA 

VLDAFLDDGFLVPTFEQLAALQIEYEENVDL 

NDVLVPKPFSQFWQPLLRGLHSQNFTQALLE 

RMLSELPALGlSGIRPTYlLRWTVELIVANnCT 

GRNARRFSAGQWEARRGWRLFNCSASLDWP 

RMVESCLGSPCWASPQLLRIIFKAMGQGLPD 

EEQEKLLRICSIYTQSGENSLVQEGSEASPIGK 

SPYTLDSLYWSVKPASSSFGSEAKAQQQEEQ 

GSVNDVKEEEKEEKEVLPDQVEEEEENDDQE 

EEEEDEDDEDDEEEDRMEVOPFSTOQESPTA 

ENARIXAQKRGALQOSAWQVSSEDVRWDTF 

PLGRMPGQTra>PAELJ»ttJBNYDTMYLUX3PV 

LEQRLEPSTCKTDTLGLSCGVGSGNCSNSSSS 

NFEGLLWSQGQIilGLKTGLQLF 


964 


2314 


A 


8184 


6 


1393 


EPRRNFRDDSTRPRTRGRTRGRRRRACRSAE 

GTOLRSLLLPPRLQLPAGPFSRCRWDPVSSPR 

PSTMPPKKGQDGIKPPPnGRFGTSLiaGIVGLP 

NVGKSTFFNVLTNSQASAENFPFCTIDPNESR 

VPVPDERFDFLCQYHKPASKIPAFLNWDIAG 

LVKGAHNGQGLGNAFLSHISACDGIFHLTRA 

FEDDDITHVEGSVDPIRDIEIIHEELQLKDEEMr 

GPIIDKLEKVAVRGGDKKLKPEYDIMCKVKS 

WVIDQKKPVRFYHDWNIKEIEVLNKHLFLTS 

KPMVYLVNLSEKDYIRKKNKWLIKIKEWVD 

KYDPGALVIPFSGALELKLQELSAEERQKYLB 



281 



wo 01/57188 



PCT/USOl/03800 



SEQ ID 
NO: of 
Qud- 
eotidc 
seq- 
uence 


SEQ ID 
NO: of 
DCDtide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 
beginning 

niirlnntiH^ 

location 

correspond! 

ngtofiist 

omino scid 

residue of 

peptide 

sequence 


Predicted end 
nudeotide 

luvtuiuu 

cojrespondiog 
tolastBmino 
acid residne 

of twtnHHp 

sequence 


Amino acid sequence (A^AIanine OCysteine, 
D=Aq>artic Add, E^lutamic Add, 
r=j^flcnyiaianine, u*=\jlycuic^ rf^rlistiaine, 
I=Isoleucine> K=Lysine, L^ljaicinej 
M=^(Iethionine, N=Asparagine; P=4^1ine, 
Q=GIutamine, R=Aiginine, S=Seriiie, 
J — tarconinc, v— vaiiDc^ w — iiypiopiian, 
Y=Tyrosine, X^^Unkaown, »=Stop codon, 
/^possible nucleotide ddetion, V'possible 
nucleotide insertion 














ANMTQSAUPKIIKAGFAALQLEYFFTAGPDEV 
RAWTTRKGTKAPQAAGKIHTDFEKGFIMAEV 
MKYEDFKEEGSENAVKAAGKYRQQGKNYIV 
EDGDIIFFKFNTPQQPKKK 


965 


2315 


A 


8195 


1437 


594 


RSFSLSFSLLSPSHylMALGAAGATRVFVAMV 

AAALUUHPIXGVSATLNSVIJSSNADCNLPPPL 

GGAAGHPGSAVSAAPGILYPGGNKYQTIDNY 

QPYPCAEDEECOTDBYCASPTRGGDAGVQIC 

IACRKRRKRCMRHAMCC3>GNYC3CNGICVSS 

DQNHFRGEffiETTTESFONDHSTLDOYSRRTT 

LSSKMYHTKGQEGSVCLRSSDCASGLCCARH 

FWSKICKPVLKEGQVCTKHRRK,GSHGLEIFQ 

RCYCGEGLSOUQKDHHQASNSSRLHTCQRH 


966 


2316 


A 


8207 


416 


4082 


KFKLIKlMLLTLIILIi>WSKFSFVSLSAPQHW 

SCPEGTLAGNGNSTCVGPAPFUFSHGNSIFRI 

DTEGTNYEQLWDAGVSVIMDFHYNEKRIY 

WVDLERQLLQRVFLNGSRQERVCNIEKNVSG 

MAINWINEEVlWSNQQEGirrVTOMKGNNSHI 

LLSALKYPANVAVDPVERFIFWSSEVAGSLY 

RADLDGVGVKALLETSEKTTAVSLDVLDKRL 

FWIQYNREOSNSLICSCDYDGGSVHISKHPTQ 

HNIJ'AMSLFGDlUFYSTWKMKTIWlANKifrG 

KDMVRINLHSSFVPLGELKWHPLAQPKAED 

DTWEPEQKLCKLRKGNCSSTVCGQDLQSHLC 

MCAEGYALSRDRKYCEGNDWKYCEDVNEC 

AFWNHGCTLGCKNTPGSYYCTCPVGFVLLPD 

GKRCHQLVSCPRNVSECSHDCVLTSEGPLCF 

CPEGSVLERDGKTCSGCSSPDNGGCSQLCVPL 

SPVSWECDCFPGYDLQLDEKSCAASGPQPFL 

LFANSQDIRHMHFDGTDYGTLLSQQMGMVY 

ALDHDPVENKIYFAHTALKWIERANMDGSQ 

RHILIEEGVDVPEGLAVDWIGRRFYWTDRGK 

SLIGRSDLNGKRSKUTIENISQPRGIAVHPMAK 

RLFWTDTGINPRIESSSLQGLGRLVIASSDLIW 

PSGITIDFLTDKLYWCDAKQSVIEMANUXjSK 

RRRLTQNDVGHPFAVAVFEDYVWFSDWAMP 

SVIRVNKRTGKDRVRLQGSMLKPSSLVWHP 

LAKPGADPCLYQNGGCEHICKKRLGTAWCS 

CTffiGFNOCASDGKTCLALDGHQLLAGGEVDL 

KNQVTPLDILSKTRVSEDNITESQHMLVAEIM 

VSIXJODCAPYGCSMYARaSEGEDATCQCLK 

GFAGDGKLCSDIDECEMGVPVC3TASSKCINT 

EGGYVCRCSEGYQGDGIHCLDIDECXJLGVHS 

CGENASCTNTEGGYTCMCAGRLSEPGLICPD 

STfPPHLREDDHHYSVRNSDSECPLSHDGYCL 

JnLlAjVlJVlilliAJJUK.YACrlCVVGYIGnR(jQYR 

DLKWWELRHAGHGQQQKVrWAVCVWLV 

MLLLLSLWGAHYYRTQKLLSKNPKNPYEESS 

RDVRSRRPADTEDGMSSCPQPWFWIKEHQD 

LKNGGQPVAGEDOQAADGSMQPTSWRQEPQ 

LCOMGTEQGCWIPVSSDKGSCPQVMERSFH 

MPSYOTQTLEGOVEKPHSLLSANPLWQQRAL 

DPPHQMELTQ 


967 


2317 


A 


8210 


3 


601 


SSAMGSRSSHAAVIPDGDSIRRETGFSQASLL 

RLHHRFRALDRNKKGYLSRMDLQQIGALAV 

NPLGDRimSFFPDGSQRVDFPGFVRVLAHFRP 

VEDEDTETQDPKKPEPLNSRRNKLHYAFQLY 

DLDRDQKISRHEMLQVLRLMVGVQVTEEQL 

ENIADRTVQEADEDGDGAVSFVEFTKSLEKM 

DVEHKMSQULK 
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wo 01/57188 



PCT/CSOl/03800 



SEQ no 

NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/4% 
914 


Predicted 

beginning 

nucleoticte 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
coiresponding 
to last amino 
add residue 
of peptide 
sequence 


Amino add sequence (A=Alanine C=Cystein^ 
D=Aspaitic Add, E=<jlutamic Add, 
F=Phenylalaiiiiie, G-Olycine, H=Histidine, 
HIsoleucine, K=Lysine, LHLeucine, 
Nit^Metfaionine, N=A^aragine, P=Proline, 
Q=Glutaininei R=Ai:ginine, S=Serinei 
T=Threonine, V=VaIine, W=TiyptojAan, 
Y=Tyrosiiie, X=Unknowii, ♦=Stop codon, 
/=possible nucleotide deletion, V=possible 
nucleotide insertion 


968 


231S 


A 


8211 


2 


409 


ISSCPHTAYEGSMSTLSNFTQTLEDAmUUFIT 

YMDNWRQNTTAEQEALQAKVDAENFYTVIL 

YLMyMTGMFSFUVAILVSTVKSKRREHSNDP 

YHQYIVEDWQEKYKSQILNLffiSKATlHENlG 

AAGFKMSP 


969 


2319 


A 


8213 


1 


1938 


GMPRSRGGRAAPGPPPPPPPPGQAPRWSRWR 

VPGRIXLLIXPALCCLPGAARAAAAAAGAGN 

RAAVAVAVARADEAEAPFAGQNWIKSYGY 

LLPYDSRASALHSAKALQSAVSTMQQFYGIP 

VTGVIXKJTTIEWMKKPRCGVPDHPHLSRRRR 

NKRYALTGQKWRQKHTTYSIHNYrFKVOELD 

TRKAIRQAFDVWQKVTPLTFEEVPYHEIKSDR 

KEADIMDFFASGFHGDSSPEDGEGGFLAHAYF 

PGPGIGGDTHFDSDEPWTLGNAhlHDGNDLFL 

VAVHELGHALGLEHSSDPSAIMAPFYQYMET 

HNFKLPQDDLQGIQKIYGPPAEPLEPTRPLPTL 

PVRRIHSPSmaCHERQPRPPRPPLGDRPSTPGT 

KPNICDGNFNTVALFRGEMFVFKDRWFVm- 

RNNRVQEGYPMQIEQFWKGLPARIDAAYER 

ADGRFVFFK(a)KYWVFKEVTVH>OYPHSLG 

ELGSCLPRBGIDTAI-RWEPVGKTYFFKGERY 

WRYSEERRATDPGYPKPrrVWKGIPQAPQOA 

FISKEGYYTYFYKORDYWKFDNQKLSVEPGY 

PRNILRDWMGCNQKEVERRKERRLPQDDVDl 

MVTINDVPGSVNAVAVVn<aLSLCILVLVYTI 

FQFKNKTGPQPVTYYKRPVQEWV 


970 


2320 


A 


8216 


1235 


2223 


SRI^QFYVSFRRTGUTCKLIVEIFFRNYMN 

DSLRTNVFVRFQPETIACACIYLAARALQIPLP 

TRPHWFLLFGTTEEEIQEICIETLRLYTRKKPN 

YELLEKEVEKRKVALQEAKLKAKGLNPDGTP 

ALSTLGGFSPASKPSSPREVKAEEKSPISINVK 

TVKKEPEDRQQASKSPYNGVRKDSKRSRNSR 

SASRSRSRTRSRSRSHTPRRHYNNRRSRSGTY 

SSRSRSRSRSHSESPRRHHNHGSPHLKAKHTR 

DDLKSSNRHGHKRKKSRSRSQSKSRDHSDAA 

KKHRHERGHHRDRRERSRSFERSHKSKHHGG 

SRSGHGRHRR 


971 


2321 


A 


8217 


3 


3274 


DCRLQAAMPINKI-VVPVEAHADGGGDETAE 

RTEAPGTPEGPEPEEU'SPGDGNPRENSPFLNN 

VEVEQESFFEGKNMALFffiEMDSNPMVSSLL 

NKLANYTNLSQGWEHEEDEESRRREAKAPR 

MGTFIGVYLPCLQNILGVlLFLRLTWrVGVAG 

VLESFLIYAMCCTCTMLTAISMSAIATNGWP 

AGGSYYMISRSLGPEFGGAVGLCFYLGTTFA 

GAMYILGITEIFLTYISPGAAIFQAEAAGGEAA 

AMLHNMRVYOTCTLVIJvlALVVFVGVKYVN 

KLALVFLACWLSILAIYAGVIKSAFDPPDIPV 

CLLGNRTI^RRSFDACVKAYGIHNNSATSAL ' 

WGLFCa^GSQPSAACDEYnQNNVTEIQGIPGA 

ASGVFLENLWSTYAHAGAFVEKKGVPSVPV 

AEESRASTLPYVLTDIAASFTLLVOIYFPSVTG 

IMAOSNRSGDLKDAQKSIPTGTILAIVTTSFIY 

LSCmFGAClEGWLRDKFGEALQGNLVIGM 

LAWPSPWVmGSFFSTCOAGLQTLTGAPRLL 

QAIARDGIVPFLQVFGHGKANGEPTWALLLT 

VLICETGIUASUJSVAPILSMFFLMCYLFVNL 

ACAVQrilLRTlWVRPRFKFYHWTl.SFLOMSL 

CLALMFICSWYYALSAMLIAGCIYKYIEYRG 

AEKEWGDGIRGLSLNAARYAIXRVEHGPPHT 

KNWRPQVLVMLNLDAEQAMKHPRIXSFTSQ 
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PCT/USOl/03800 



SEQQ) 
NO: of 

eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
(»/496 
914 


Predicted 

beginning 

nucleotide 

location 

cx>iTcspondi 

ng ID first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alannie OCysteine, 
D=Aspartic Acid, B=Glutamic Add, 
F=PnenylalEnme, G^lycine, H=Histidine, 
I=Isoleucine, K=Lysjne, IMjeucine, 
M^Methionine^ N=AspaiBgine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serinei, 
T=Threonine, V=Valine, W=Tiyptophan, 
Y=Tyrosine, X=Unbiown, *=^top codon, 
/^possible nucleotide deletion, V^osdble 
nucleotide insertion 














LKAGKGLTIVGSVLEGTYLDKHMEAQRAEE 

NIRSLMSTEKTKOFCQLWSSSLRDGMSHLIQ 

SAGLGGLKHNTVLMAWPASWKQEDNPFSW 

KNFVDTVRDTTAAHQALLVAKNVDSFPQNQ 

ERFGGGHIDVWWIVHDOGMLMLLPFLLRQH 

KATWRKORMRffTVAQVDDNSIQMKKDLQMF 

LYHLRISAEVEVVEMVENDISAFTYERTLNIM 

EQRSQMLKQMQLSKNEQEREAQLIHDRNTAS 

HTAAAARTQAPPTPDKVQMTWTREKLIAEK 

YRSRDTSLSGFKDLFSMKPDQSNVRRMHTAV 

KLNGWLNKSQDAQLVLLNMPGPPKNRQOD 

ENYMEFLEVLTEGLNRVLLVRGGGREVmYS 


972 


2322 


A 


8224 


701 


246 


TSMIVTMKFNPFVTSDRSKNRKRHFNAPSHV ~ 

RRKIMSSPLSKELRQKYNVRSMFIRKDDEVQ 

WRGHYKGQQIGKWQVYRKKYVIYIERVQ 

REKANGTTVHVGIHPSKVVrrRLKLDKDRKKI 

LERKAKSRQVGKEKGKYKBELIEKMQE 


973 


2323 


A 


8237 


873 


4610 


GCPHAGGKGRVPTGGLTGGRTWSPSAAPRSC 

PRPGPTPAPGAMDKLPPSMRKRLYSLPQQVG 

AKAWIMDEEEDAEEEGAGGRQDPSRRSnUJR 

PLPSPSPSAAAGGTESRSSALGAADSEGPARG 

AGKSSTNGDOIRFRGSLASLOSRGOGSGGTG 

SGSSHGHLHDSAEERRLIAEGDASPGEDRTPP 

GLAAEPERPGASAQPAASPPPPQQPPQPASAS 

CEQPSVDTAKVEGGAAAGDQILPEAEVRLG 

QAGFMQRQFGAMLQPGVNKFSLRMFGSQKA 

VEREQERVKSAGFWnHPYSDFRFYWDLTML 

LmVGNLmPVGriTFXDENTTPWIVFNVVSD 

TFFUDLVLNFRTOIWEDNTEIILDPQRIKMK 

YUCSWFMVDFISSIPVDYIFLIVETRIDSEVyK 

TARAIJUVRFTKILSIXRIXRLSRLIRYIHQWE 

EJFHNfTYDLASAVVRIVNlJGMMLLLCHWDG 

CLQFLVHvflLQDFPDDCWVSINNMVNNSWGK 

QYSYALFKAMSHMLCIQYGRQAPVGMSDV 

WLTMLSMIVGATCYAMnGHATALIQSLDSS 

RRQYQEKYKQVEQYMSFHKLPPDTRQRIHD 

YYEHRYQGKMFD^ILGELSEPLREEIINFNC 

RKLVASMPIJANADPNFVTSMLTKLRFEVFQ 

PGDYnKEGTKaaCMYHQHGWSVLTKGNKE 

IXLADGSYFGBICLLTRGRRTASVRADTYCR 

LYSLSVDNFNEVLEEYPMMRRAFETVALDRL 

DRIGKKNSnXHKVQHDLNSGVFNYQENEnQ 

QIVQHDREMAHCAHRVQAAASATPTPTPVIW 

TPLIQAPLQAAAATTSVAIALTHHPRLPAAIFR 

PPPGSGLGNLGAGQTPRHLKRLQSLIPSALGS 

ASPASSPSQVDTPSSSSFHIQQLAOFSAPAGLS 

PLLPSSSSSPPPGACGSPSAPTPSAGVAATTIA 

GFGHFHKALGGSLSSSDSPLLTPLQPGARSPQ 

AAQPSPAPPGARGGLGLPEHFLPPPPSSRSPSS 

SPGQLGQPPGELSLGLATGPLSTPBTPPRQPEP 

PSLVAGASGGASPVGFTPRGGLSPPGHSPGPP 

RTFPSAPPRASGSHGSUXPPASSPPPPQVPQR 

RGTPPLTPGRI TODI KLTSA!?OPAT POTy?AnT 

LRRASPHSSGESMAAFPLFPRAGGGSGGSOSS 
GGLGPPGRPYGAIPGQHVTLPRKTSSGSLPPP 
LSLFGARATSSGOPPLTAGPQREPGARPEPVR 
SKLPSNL 


974 


2324 


A 


8247 


279 


468 


EYKQWBRRFLSCQNRNDLGYOKPRKGGGLL 
LVPVKDASRICSLTYIXGSHWNNLWRSPVL 
G 



284 



wo 01/57188 



PCTAISOl/03800 



SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspond! 

ng to first 

amino add 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine 0=Cysteine, 
I>=Aspartic Acid, £=Glutamic Acid, 
F=Phenylalanme, G=Glycine, H^Histidine, 
I^Isoleucine, K=Lysine, IM^cine, 
M=M^anine. N=Aq)aragine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tiyptophan, 
Y=Tyrosine, X=Unknown, ♦=Stop codon, 
/=^x>ssible nucleotide deletion, V^possible 
nucleotide insertion 


975 


2325 


A 


8249 


62 


1571 


LVAUaWKPKGTNIPAPQSPVFGEAVSGVYM 

MTKVLGMAPVLGPRPPQEQVGPLMVKVEEK 

BEKGKYLPSLEMFRQRFRQFGYHDTPGPREA 

LSQLRVLCCEWLRPEDHTKEQILELLVUBQFLT 

ELPQELQAWVQEHCPESABEAVnXEDLEREL 

DEPGHQVSTPPNEQKPVWEKISSSGTAKESPS 

SMQPQPLETSHKYESWGPLYIQESGEEQEFAQ 

DPRKVRDCRLSTQHEESADEQKGSEAEGLKG 

DnsvnANKPEASLERQCVNLENEKGTKPPLQ 

EAGSKKGRESVPTKPTPGERRYICAECGKAFS 

NSSNLTKHRRTHTGEKPYVCTKCGKAFSHSS 

NLTLHYRTHLVDRPYDCKCGKAFGQSSDIXK 

HQRMHTEEAPYQCKDCGKAFSGKGSURHYR 

IHTGEKPYQCNECGKSFSQHAGLSSHQRLirr 

GEKPYKCKECGKAFNHSSNFNKHHRIHTGEK 

PYWCHHCGKTFCSKSNLSKHQRVHTGEOEA 

P 


976 


2326 


A 


8257 


298 


7086 


GNMACWPQLRLLLWKNLTFRKRQTCQIXLE 

VAWPLFIFLIUSVRLSYPPYEQHECHFPNKAM 

PSAOTLPWVQGnCNANNPCFRYPTPGEAPGV 

VGNFNKSrVARLFSDARRUXYSQKDTSMKD 

MRKVLRTLQQIKKSSSNLKLQDFLVDNETFS 

OFLYHNLSLPKSTVDKMLRADVILHKVFLQO 

YQLHLTSLCNGSKSEEMIQLGDQEVSELCGLP 

REKLAAAERVLRSNMDILKPILRTLNSTSPFPS 

KELAEATKTLLHSLGTLAQELFSMRSWSDMR 

QEVMFLTNVNSSSSSTQIYQAVSRIVCGHPEG 

GGLKIKSLNWYEDNNYKALFOGNOTEEDAE 

TFYDNSTTPYCNDLMKNLESSPLSRnWKALK 

PIXVGKILYTPDTPATRQVMAEVNKTFQELA 

VFHDLEGMWEELSPKTWTFMENSQEMDLVR 

MLLDSRDNDHFWEQQUX}LDWTAQDIVAFL 

AKHPEDVQSSNGSVYTWREAFNETNQAIRTIS 

RFMECVNLNKLEPIATEVWLINKSMELLDER 

KFWAGIVFTGITPGSIELPHHVKYKIRMOIDN 

VERTNKIKDGYWDPGPRADPFEDMRYVWGQ 

FAYLQDWEQAIIRVLTGTEKKTGVYMQQMP 

YPCYVDDIFLRVMSRSMPLFMTLAWIYSVAV 

nKGIVYEKEARLKETMRIMGLDNSILWFSWFI 

SSLIPLLVSAGLLWILKLGNLU'YSDPSWFV 

FLSVFAWTILQCFUSTLFSRANLAAACGOn 

YFTLYLPYVLCVAWQDYVGFTLKIFASLLSP 

VAFGFGCEYFALFEEQGIGVQWDNLFESFVE 

EIXJFNLTrSVSMMLFiyrFLYGVMTWYIEAVF 

PGQYGIPRPWYEPCrKSYWFGEESDEKSHPGS 

NQKRISEICMEEEPTHLKLGVSIQNLVKVYRD 

OMKVAVDOLALNFYEGQITSFLGHNOAGKT 

TTMSILTGLFPPTSGTAYILGKDIRSEMSTIRQ 

NLGVCKJHNVLFDMLTVEEHIWFYARLKGLS 

EKHVKAEMEQMALDVGLPSSKLKSKTSQLS 

GGMQRKLSVALAFVGGSKWILDBPTAGVDP 

YSRRGIWELLLKYRQGRTIILSraHMDEADVL 

QDRIAnSHOKLCCVGSSLFLKNQLGTGYYLT 

LVKKDVESSLSSCRNSSSrVSYLKKEDSVSQS 

SSDAGLGSDHESDTLTIDVSAISNLIRKHVSEA 

RLVEDIGHELTYVLPYEAAKBGAFVELFHBID 

DRLSDLGISSYGISETTLEEIFLKVAEESGVDA 

ETSDGTLPARRNRRAFODKQSCLRPFTEEMDA 

ADPNDSDIDPESRETDLLSGMDGKGSYQVKG 

WKLTQQQFVAU-WKRLLIARRSRKGFFAQIV 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
Naof 
p^tide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

begirniing 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
DMspartic Acid, £=Glutamic Acid, 
F=Phenylalanine, G=Glycine, IH&tidinei, 
l-Isoleucine, K=Lysine, LMxucinc, 
M=*leaiionine, N=Asparaglnc, F^ProIine, 
Q=Glutamine, R=Aigintae, S=Serine, 
T=Threonine, V=Valine, W=Tiyptophan, 
Y=Tyrosine, X=Unknown, *=^top codon, 
/^'^mssible nucleotide deletion, V=possible 
nucleotide insertion 














LPAVFVCIALVFSUVPPFGKYPSLELQPWMY 

NEQYTFVSNDAPEDTGTLELLNALTKDPGFO 

TRCMEGNPIPDTPCQAGEEEWTTAPVPQTIM 

DLFQNGNWTMQNPSPACQCSSDKIKKMLPV 

CPPGAGGLPPPQRKQNTADILQDLTORNISDY 

LVKTYVQIIAKSLKNKIWVNEFRYOOFSLGVS 

NTQAUTSQEVNDATKQNIKKHLKLAKDSSA 

Dmj^SLGRFMTGLDTRNNVKVAVFNNKGW 

HAISSFLNVINNAILRANLQKGH«IPSHYGrrAF 

NHPLNLTKQQLSEVAPMTTSVDVLVSICVIFA 

MSFVPASFWFUQERVSKAKHLQFISGVKPVl 

YVa-SNFVWDMCNYVVPATLVniFICTQQKSY 

VSSTNIJPVLALLLLLYGWSITPLMYPASFVFK 

IPSTAYWLTSVNLFIGINGSVATFVLELFTDN 

KLhmiNDILKSVFLIFPHFCLGRGLIDMVKNQ 

AMADALERFGENKFVSPLSWDLVGRNLFAM 

AVEGVVFFUTVLIQYRFFIRPRFVNAKLSPLN 

DEDEDVRRERQRILDGGGQNDILEKELTKIY 

RRKRKPAVDRICVGIPPGECFGLLGVNGAGK 

SSTI^MLTGDTTVTRGDAFLNRNSILSNIHEV 

HQNMGYCPQFDAITELLTOREHVEFFALLRG 

VFEKEVGRVGEWAmKLOLVKYGEKYAGNY 

SGGNKRKLSTAMALIGGPFWFLDEPTTGMD 

PKARRFLWNCALSWKEGRSWLTSHSMEEC 

EALCTRMAIMVNGRFRCLGSVQHLKNRFGD 

GYTTVVRIAGSNPDLKPVQDFFGLAFPGSVPK 

EKHRNMLQYQLPSSLSSLARIFSILSQSKKRLH 

DEDYSVSQTTLDQVFVNFAKDQSDDDHLKDL 

SLHKNQTWDVAVLTSFLQDEKVKESYV 


977 


2327 


A 


8260 


3 


1567 


IPGSnSFSLCFlFPPCVPTMVRKPVVSnSKGO 

YLQGNVNGRLPSLGNKEPPGQEKVQLKRKV 

TLLRGVSraGTHGAGIFISPKGVLQNTGSVGM 

SLnWTVCGVLSLFGALSYAELGTTIKKSGGH 

YTYILEVFGPLPAFVRVWVELLIIRPAATAVIS 

LAFGRYILEPFFIQCEIPELAIKLITAVGITWM 

VLNSMSVSWSARIQlFLTFCKLTAILmVPGV 

MQLKGQTQNFKDAFSGRDSSITRLPLAFYYG 

MYAYAGWFYLNFVTEEVENPEKTIPLAICISM 

AIVnGYVLTNYAYFTTINAEELLLSNAVAVT 

FSERLLGNFSLAVPIFVALSCFGSMNGGVFAV 

SRLFYVASREOHLPEDLSMIHVRICHTPLPAVIV 

LHPLTMIMLFSGDLDSLLNFLSFARWLFIGLA 

VAGLIYLRYKCPDMHRPFKVPLFIPALFSFrC 

LFMVALSLYSDPFSTGIGFVITLTGVFAYYLFn 

WDKKPRWFRIMSEKrrRILQnLEWPEHDKL 


978 


2328 


A 


8261 


2 


216S 


RGGSLRCVLGKLLGQLLCFQSERCVRFPEGLL 

RHRGCOLLSSRLSAGKPPLRTSFFQSWGVLPP 

LADAASMSGVRAVRISIESACEKQVHEVGLD 

GTETYLPPLSMSQNLARLAQRIDFSQGSGSBS 

EEAAGTEGDAQEWPGAGSSADQDDEEGWK 

FQPSLWPWDSVRNNLRSALTEMCVLYDVLSI 

VRDKKFMTLDPVSQDALPPKQNPQTXQLISK 

JIUUu^uAAQiLiJvCiAllKLl VI KNQENKI^ 

RDFNSEIXRLRQHWKLRKVGDKILGDLSYRS 

AGSLFPHHGTFEVDCNTDLDLDKKIPEDYCPL 

DVQIPSDLEGSAYDCVSIQKQAPDIGDLGTVN 

LHaU>LPKSKPGSPHWQTia.EAAQNVIXCKEI 

FAQLSREAVQKSQVPmWKNQnSQPFPSLQ 

LSISLCHSSNDKKSQKFATEKQCPEDHLYVLE 

HhOJnXIREFHKQfn-SSIMMPHPASAPFGHKR 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/4% 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

Dg to first 

amino acid 

residueof 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid, B=Glutamic Add, 
F=4%enylalanine, G=Glycine, H=Histidine, 
I^Isoleudne, K=Cysine, Lr=Leucine, 
M°4)iletfaionine, N=Asparagine, P=Prolin6, 
Q=<}lutamine, R=Aigiiiine, S=Scrine, 
■P=a7ireonlne, V=Valine, W=Tiyptophan, 
Y=Tyrosine; X=^nknown, *=Stop oodon, 
/=^ssible nucleotide deletion, \r^ossible 
nucleotidB insertion 














MRLSQPQAFDKNEINSLQSSEOLLEKIIKQAK 

HIFLRSRAAATIDSLASRIEDPQIQAHWSNIND 

VYESSVK-VLITSQGYEQICKSIQLQLNIGVEQI 

RWHRDGRVTTLSYQEQELQDFLLSQMSQHQ 

VHAVQQLAKVMGWQVLSFSNHVGLGPIESIG 

NASATTVASPSGDYAISVKNGPESGSKIMVQF 

PRNQCKDLPKSDVLQDNKWSHLRGPFKEVQ 

WNKMEGRNFVYKMELLMSALSPCLL 


979 


2329 


A 


8289 


2 


1053 


FVWNPRGGRKRRRQAAVTQAATRASGTPSP 

RDGTMTQGKLSVANKAPGTCGQQQVHGEKK 

EAPAVPSAPPSYEEATSGEGMKAGAFPPAPTA 

VPLHPSWAYVDPSSSSSYDNGFPTODHELFTr 

FSWDDQKVRRVFVRKVYTILLIQLLVTLAW 

ALFTFCDPVKDYVQANPGWYWASYAVFFAT 

YLTIACCSGPRRHFFVWUILLTVFrLSMAYLT 

GMLSSYYNTTSVLLCLGITALVCLSVTVFSFQ 

TKFDFTSCQGVLFVIXMmTSGLILAILLPFQ 

YVPWLHAVYAAIXJAGVFTLFLALDTQLLMG 

NRRHSLSPEEYIFGALMYLDnYIFrFFLQLFG 

TNRE 


980 


2330 


A 


8305 


59 


857 


ASQLPDYSISPPSLPPRISFHPSPTLARVAMAEP ' 

SEATQSHSISSSSFGAEPSAPGGGGSPGACPAL 

GTKSCSSSCAVHDUFWRDVKKTGFVFGTTLI 

MLLSLAAFSVISWSYLILALLSVnSFRrYKSV 

IQAVQKSHEGHPFKAYLDVDITLSSEAFHW 

MNAAMVHINRALKLIIRLFLVEDLVDSLKLA 

VFMWLMTYVGAVFNGITLLILAELLIFSVPIV 

YEKYKTQEDHYVGIARDQTKSIVEKIQAKLPG 

lAKKKAE 


981 


2331 


A 


8308 


186 


1337 


TRMSRHEGVSCDACLKGNFRGRRYKCLICYD 

YDLCASCYESGATTraHTTDHPMQCILTRVD 

FDLYYGGEAFSVEQPQSFrCPYCX3KMGYTET 

SLQEHVTSEHAETSTEVICPICAALPGGDPNH 

VTDDFAAHLTLEHRAPRDLDESSGVRHVRR 

MFHPGRGLGGPRARRSNMHFTSSSTGGLSSS 

QSSYSPSNREAMDPIAELLSQLSGVRRSAGGQ 

LNSSGPSASQLQQLQMQLQLSIQHAQAARQ 

QLETARNATRRThn^SVTTriTQSTATTNIAN 

TESSQQTLQNSQFLLTRLNDPKMSETERQSM 

ESERADRSLFVQELLLSTLVREESSSSDEDDR 

GEMADFQAMGCVDIMPLDVALENLNLKESN 

KGNEPPPPPL 


982 


2332 


A 


831S 

* 


1 


1004 


GSTHASADAWAQWFCIHALVMOAPVWYLV 

AAALLVGFDLFLTRSRGRAASAGQEPLHNEEL 

AGAGRVAQPGPLEPEEPRAGORPRRRRDLGS 

RLQAQRRAQRVAWAEADENEEEAVILAQEE 

EGVEKPAETHLSGKIGAKKLRKUEEKQARKA 

QREAEEAEREERXRLESQRHAEWKKEEERLR 

LEEEQKEEEERKAREEQAQREHEEYLKLKEA 

FWEEEGVGE™rrEE(^SFLTEFINYlKQSK 

WIXEDLASQVGLRTQDnNRIQDLLAEGTIT 

UVUJlJKuJU'IYlli'bbLAAVANFIRQRGRVSIA 

ELAQASNSLIAWGRESPAQAPA 


983 


2333 


A 


8320 


244 


1420 


RRRWRARGGLVPTLAWAEATGAYVPGRDKP 
DLPTWKROTRSALNRKBGLRLAEDRSKDPHD 
PHKIYEFVNSGVODFSQPDTSPDTNGGGSTSD 
TQEDILDELLGNMVLAPLPDPGPPSLAVAPEP 
CPQPLRSPSLDNPTPFPNLGPSENPLKRLLVPG 
EEWEFEVTAFYRGRQVFQQTISCPEGLRLVGS 
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SEQID 
NO. of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

cixTCspondi 

ngtofiist 

amino acid 

leskfaieof 

pqitide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A^Alamne OCysteine, 
D=Aspartic Acid, E=01utamic Acid, 
F^^henylalanine, G=Glycine, jE^Histidine, 
Msoleudne, K=Lysine, I^Leucine^ 
M^Methionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Argiiiine, S=^erine, 
T=Threonine, V=Valine, W=Tiyptopiian, 
Y=TyrtJsin6, X=Unknowii, *=Stop codon, 
/=possible nucleotide deletion, V=possible 
nucleotide insertion 






1 








EVGDRTLPGWPVTLPDPOMSLTDRGVMSYV 

RHVLSCLGGGLALWRAGQWLWAQRLGHCH 

TYWAVSEEIXPNSOHGPDOEVPKDKEGOVF 

DLGPPrVGSLGPPDLriTTEGSGRSPRyALWFC 

VGESWPQDQPWTKRLVMVKVVPTCLRALVE 

MARVGGASSLENTVDLHISNSHPLSLTSDQY 

KAYLQDLVEGMDFQGPGBS 


984 


2334 


A 


8321 


1 


1243 


ANMAPVEHWADAGAFLRHAALQDIGKNIY 

TTREVVTEIRDKATRRR1AVIJ»YELRFKEPLPE 

YVRLVTEFSKKTODYPSLSATDIQVLALTYQL 

EAEFVGVSHLKQEPQKVKVSSSIQHPEITLHIS 

GFHLPYKPKPPQETEKGHSACEPENLEFSSFM 

FWRNPLPNIDHELQELLIDRGEDVPSEEEEEEE 

NOFEDRKDDSDDJDGGGWrrPSNnCQlQQELE 

QCDVPEDVRVGCLTTDFAMQNVLLQMGLHV 

lAVNGMLIREARSYILRCHGCFKTTSDMSRV 

FCSHCGNKTLKKVSVTVSDDGTLHMHFSRNP 

KVLNPRGLRYSLPTraCGGKYAINPHLTBDQRF 

PQLRLSQKARQKTNVFAPDYIAGVSPFVENDI 

SSRSATIXJVRDSTU3A0RRRLNPNASRKKFV 

KKR 


985 


2335 


A 


8322 


352 


529 


RKNNIRQFIMKVaSGQARWLTPWPVLWET 
EAGRSLELKSLRPAWATWGNPISTKINK 


986 


2336 


A 


8325 


89 


1172 


KMNPTDIADTTLDESIYSNYYLYESIPKPCTKE 

GKAFGELFLPPLYSLVFVFGLLGNSWVLVL 

FKYKRLRSMTDVYIXNLAISDLLFVFSLPFWG 

YYAADQWVFGLGLCKMISWMYLVGFySGIF 

FVMlMSIDRYLArVHAVFSLRARTLTYGVITS 

LATWSVAVFASIJKjFIJSTCYTERNHTYCKT 

KYSLNSTTWKVLSSLEINILGLVIPLGINlUXrY 

SMIIRTLQHCKNEKKNKAVKMIFAVVVLFLG 

PWlPYNiyLFI^TLVELBVLQDCTETOYLDYA 

IQATETLAFVHCXXNPnyFFLQEKFRKYILQL 

FKTCRGLFVLCQYCGLLQIYSADIPSSSYTQS 

TMDHDLHDAL 


987 


2337 


A 


8326 


3 


470 


SLSAMRFLAATFLLLALSTAAQAEPVQFKDC 
GSVDOVnCEVNVSPCPTQPCQLSKGQSYSVN 
VTFTSNIQSKSSKAWHGILMGVPVPFPIPEPD 
GCKSGINCPIQKDKTYSYLNKLPVKSEYPSIK 
LWEWQLQDDKNQSLFCWEIPVQIVSHL 


988 


2338 


A 


8335 


1205 


323 


vikmalaarujpqflhsrsij<;gavrlrtpa 

vaevrlpsatlcyfcrcrlgloaalfprsar 

alaasalpaqgsrwpvlsspglpaafasfpac 

pqrsysteekpqqhqktkmivlgfsnpinwv 

rtrkafuwayfdkefsitefsegakqafah 

vskllsqckfdllbelvakevlhalkbkvts 

lpdniknalaanideivftstgdisiyydekg 

rkfvnilaicfwyltsanipsetlrgasvfqvk 

lgnqnvetkqllsasyefqreftqgvkpdvnr 

iariehsklle 


989 


2339 


A 


8349 


67 


185 


MSGFMQIXIQNlJOrYHrRLKTSQGLCLLSL 
KSLHPMS 


990 


2340 


A 


8361 


210 


Ills 


aspflrpqohdsgerepfsqtpglmqpfsipvq 

itlqgsrrrqgrtafpasgkkretoysdgdpl 

dvhkrlpsstgedravmlgfammgfsvlk4f 

fixgtiukpfmlsiqreestctaihtdiniddw 

ldcaftcgvhchgqgkypclqvfvm^hpg 

qkallhyneeavqinpkc3fytpkchqdrndl 

lnsaldkeffdhkngtpfscfyspasqsbdvi 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 

UC0C6 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

banning 

nucleotide 

location 

correspondi 

ng to first 

amino add 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanme OCysteine, 
D=Aspaitic Acid, E=Glutaniic Add, 
F-^Phenylalanine, G=Glycine, IHIistidine, 
I=-lsoIeucine, K-=Lysine, L'^jcucinc. 
M=^ctiiianine, N=AspaTagine, P^Proline, 
Q=Ghitamine, R=Ai:ginine, S=Scrin^ 
■p=Threoninc, V=Valine, W=Tiypt(^)iian, 
Y=TyTosinei, X=Unknown, ♦=Stop codon, 
/=1)ossible nucleotide deletion, V=possible 
nucleotide insertion 














LIKKYDQMAIFHCLFWPSLTLLGGAIJVGMV 
RLTQHLSLLCEKYSTWRDEVGGKVFYIEQH 
QFKLCIMRRSKGRAEKS 


991 


2341 


A 


8369 


9 


921 


SSWEFSALSVSMACLSPSQLQKFQQDGFLVL 

EGFLSAEECVAMCJQRIGErVAEMDVPLHCXT 

EFSTQEEEQLRAQGSTDYFLSSGDKIRFFFEK 

GVFDEKGNFLVPPEKSINKIGHALHAHDPVFK 

SrrHSFKVQTLARSLGLQMPVVVQSMYIFKQP 

HFGGEVSPHQDASFLYTEPLGRVLGVWIAVE 

DATLENGOLWFIPGSHTSGVSRRMVRAPVQS 

APGTSFLOSEPARDNSLPVPTPVQRGALVLIH 

GEWHKSKQNLSDRSRQAYTTHLMEASGTT 

WSPENWLQPTAELPFPQLYT 


992 


2342 


A 


8370 


906 


4 


MALSGNCSRYYPREQGSAVPNSFPEWELNV 

GGQVYFTRHSTLISIFHSLLWKMFSPKRDTAN 

DLAKDSKGRFFIDRDOFLFRYILDYLRDRQW 

LPDHFPEKGRLKREAEYFQLPDLVKLLTPDEI 

KQSPDEFCHSDFEDASQGSDTRICPPSSLLPAD 

RKWGFrrVGYRGSCTLGREGQADAKFRRVPR 

ILVCGRISLAKEVFGETLNESRDPDRAPERyTS 

RFYLKFKHLMGAPASNFILGFWGLGQNQDK 

HPVNIYLQQRSVIRPDLTSKKAODUCGKGDA 

QEVSRRRRWLGDPEHL 


993 


2343 


A 


8379 


■ 


2794 


MRMQRHKNDTMDFGDSGKRIGGGVLCLLHQ 

SNTSFIKLNNNGFEDIVIVIDPSVPEDEKnEQIE 

DMVTTASTYLFEATEKRFFFKNVSILIPENWK 

ENPQYKRPKHENHKHADVIVAPPTLPGRD)a» 

YTKQFTECGEKGEYIHFTPDLLLGKKQNEYG 

PPGKLFVHEWAHLRWGVFDEYNEDQPFYRA 

KSKKIEATRCSAGISGRNRVYKCQGGSCLSRA 

CRIDSTTKLYGKDCQFFPDKVQTKASIMEM 

QSIDSWHOJEKTHNQEAPSLQNIKCNFRST 

WEVISNSEDFKNTIPMVTPPPPPVFSLLKIRQRI 

VCLVLDKSGSMGGKDRLNRMNQAAKHFIXQ 

TVENGSVA^GMVHFDSTATIVNKLIQIKSSDER 

NTLMAGLPTYPLGGTSlCSGnCYAFQVIGELH 

SQUXJSEVLLLTDGEDNTASSODEVKQSGAI 

VHFlALGRAADEAVIEMSKrrGGSHFYVSDEA 

QNNGLIDAFGALTSGNTDLSQKSLQLESKGLT 

LNSNAWMNDTVnDSTVGKDTFFLITWNSLPP 

SISLWDPSGTIMENFTVDATSKMAYLSIPGTA 

KVGTWAYNLQAKANPETLTITVTSRAANSSV 

PPITVNAKMNKDVNSFPSPMIVYAEILQGYVP 

VLGANVTAFIESQNGHTEVLELLDNGAGADS 

FKNDGVYSRYFTAYTENGRYSLKVRAHQGA 

NTARLKLRPPLNRAAYIPGWWNGEIEANPP 

RPEIDEDTQTTLEDFSRTASGGAFWSQVPSL 

PLPDQYPPSQITDLDATVHEDKIILTWTAPGD 

NFDVOCVQRYDRISASILDLRDSFDDALQVN 

TTDLSPKEANSKESFAFKPENISEENATHIFIAI 

KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPT 

rirlrlrl rumiiUNou VjNlo 1 i. VL5> VlGo WTv 

NFELSTTf 


"994 


2344 


A 


8385 


231 


644 


INSSPRTGRUHQHLNLHTBRDSRSQRAVLKIP 
RQNPGIFYWIELPSRSHSASHGSRQRQVSCQG 
TQDEILKMRNTFAELKNSLE/VLSSRMDQAEE 
RIGTQAGVQWRDHGSLQPQPPEFKQCFHLSL 
PSSWDYRACLS 


995 


2345 


A 


8390 


194 


3421 


awrksswpprgtrrgeksuqdKsgqkNkr 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 

IDNa 

in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

conespondi 

ng to fust 

amino acid 

residue of 

peptide 

sequence 


Picdicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=°Alanine OCysteine, 
D=Aspartic Add, EMjlutamic Acid, 
F=PhenylBlanine, G=Glycine, H=4Dstidinei 
I=Isoleuciiie, K=Lysine, Lf=Leucine, 
M=^{e(hionine, N=Asparagine, P=ProIine, 
Q=Glutaniine, R^Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosinc, X=Unknovm, *"Stop codon, 
/=pcssible nucleotide deletion, V=possible 
nucleotide insertion 














DFLSMKQSPALAPEERCRRAGSPKPVLRADD 

NNMGNGCSQKLATANLLRFLLLVLIPaCALV 

LLLEILLSYVGTLQKVYFKSNGSEPLVTOGEI 

QGSDVILTNnYNQSTWSTAHPDQHVPAWT 

TDASLPGDQSHRNTSACMNITHSQCQXCLPYH 

ATLTPIXSVVRNMEMEiaajaT=TYLHRLSCY 

QraMU^GCTXAFPECinXiDDSHGLLPCRSFCE 

AAKEGCESVLGMVNYSWPDFLRCSQFRNQT 

ESSNVSRICFSPQQENGKQLLCGRGENFLCAS 

GICIPGKLQCNGYNDCDDWSDEAHCNCSH^IL 

FHCHTGKCLNYSLVCDGYDDCGDLSDEQNC 

DCNPTTEHRCGDGRCIAMBWVCDGDHDCVD 

KSDEVNCSCHSQGLVECRNGQCIPSTFQCDG 

DEDCKDGSDEENCSVIQTSCQEGDQRCLYNP 

CLDSCGGSSLCDPNNSLNNCSQCEPITLELCM 

NLPYNSTSYPNYFGHRTQKEASISWESSLFPA 

LVQTNCYKYLMFFSCTILVPKCDVNTGEHIPP 

CRALCHISKERCESVLGIVGLQWPEDTDCSQ 

FPEENSDNQTCLMPDBYVEECSPSHFKCRSGQ 

CVLASRRCaXJQADCDDDSDEENCGCKERDL 

WECPSNKQCLKHTVICDGFPDCPCYMDEKN 

CSFCQDDELECANHACVSRDLWCDGEADCS 

DSSDEWDCVTLSINVNSSSFLMVHRAATEHH 

VCADGWQEILSQLACKQMGLGEPSVTKUQE 

QEKEPRWLTLHSNWESLNGTTLHELLVNGQS 

CESRSKISLLCTKQDCGRRPAARMNKRILGGR 

TSRPGRWPWQCSLQSEPSGHICGCVLIAKKW 

VLTVAHCFEGRENAAVWKWLGINNLDHPS 

VFMQTRFVKTIILHPRYSRAArVDYDISrVELSE 

DISETGYVRPVCXPNPEQWLEFDrYCYITGW 

GHMGNKMPFKLQEGEVRnSLEHCQSYFDMK 

TnTRMICAGYESGTVDSCMGDSGGPLVCEK 

PGGRWTLFGLTSWGSVCFSKVLGPGVYSNVS 

YFVEWKRQIYIQTFLLN 


996 


2346 


A 


8392 


199 


3085 


KVILSSEMSKTNKSKSGSRSSRSRSASRSRSRS 

FSKSRSRSRSLSRSRKRRLSSRSR8RSYSPAHN 

RERNHPRVYQNRDFRGHNRGYRRPYYFRGR 

NRGFYPWGQYNRGGYGNYRSNWQNYRQAY 

SPRRGRSRSRSPKRRSPSPRSRSHSRNSDKSSS 

DRSRRSSSSRSSSNHSRVESSKKKSAKEKKSSS 

KDSRPSQAAGDNQGDEVKEQTFSGGTSQDTK 

ASESSKPWPDATYGTGSASRASAVSELSPRER 

SPALKSPLQSVWRRRSPRPSPVPKPSPPLSST 

SQMGSTLPSGAGYQSGTHQGQFDHGSGSLSP 

SKKSPVGKSPPSTOSTYOSSQKEESAASQOAA 

YTKRYLEEQKTENGKDKEQKQTNTDKEKKE 

KGSFSDTGLGDGKMKSDSFAPKTDSEKPFRG 

SQSPKRYKLRDDFEKKMADFHKEEMDDQDK 

DKAKGRKESEFDDEPKFMSKVIGANKNQEBE 

KSGKWEGLVYAPPGKEKQRKTEELEEESFPE 

RSKKEDRGKRSEGGHRGFVPEKNFRVTAYK 

AVQEKSSSPPPRKTSESRDKLQAKGDFPTGKS 

SFSITREAQVNVRMDSFDEDLARPSGLLAQER 

KLCRDLVHSNKKEQEFRSIFQHIQSAQSQRSP 

SELFAQfflVnVHHVKEHHFGSSGMTLHERFT 

KYLKRGTEQEAAKNKKSPEIHRRIDISPSTFRK 

HGLAHDEMKSPREPOYKAEGKYKDDPVDLR 

LDIERRKKHKERDLKRGKSRESVDSRDSSHSR 

ERSAEK.TEKTHKGSKKQKKHRRARDRSRSSS 

SSSQSSHSYKAEEYTEETEEREESTTGFDKSRL 



290 



wo 01/57188 



PCT/USOl/03800 



SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

conespondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
coiresponding 
Id last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cystetne, 
I>=Aspartic Acid, E=Glutamic Acid, 
F=4>haiyla]anine, G-Olycine, H=4Iistidine, 
I=Isoleudne, K=Lysinc, L^-Leucinc, 
M==Mediionine, N=Aspaiagine, P=Pioline, 
Q=Ghitamine, R=AiginIne, S=Serine. 
T=ThreoninE, V=VaIine, WsTiyptophan, 
Y='iyosine, X=4Jnlmown, *=Stop codon, 
/==possible nucleotide deletion, V^ssible 
nucleotide insertion 














GTKDFVGPSERGGGRARGTFQFRARGRGWG 

RGNYSGNNNNNSNNDFQKRNREEEWDPEYT 

PKSKKYYLHDDREGEGSDKWVSRGRGRGAF 

PRGRGRFMERKSSTSPKWAHDKFSGEEGEIE 

DDESGTENREEKDNIQPTTE 


997 


2347 


A 


8398 


202 


532 


CPALGGRQDLQGTRLLWAHDSGVGGQKAKS 
KQEhnLESLEATGREEEGGQGPPVmCGVLLA 
LLMAGLALQPGTALLCYSCKAQVSNEDCLQ 
VENCTQLGEQCWTARIREWGDDSRQA 


"998 


2348 


A 


8400 


697 


301 


NPPSACIPOSCDSCSGRGRDLAFDSVWSTNN 

MSDPRRPNKVLRYKPPPSECNPALDDPTPDY 

MNIXGMIFSMCGLMLKLKWCAWVAVYCSn 

SFANSRSSEDTKQMMSSFMLSISAWMSYLQ 

NPQPMTPPW 


999 


2349 


A 


8401 


93 


1126 • 


ASASHITSGHLRCFPGSEGVGTMARCFSLVLL 

LTSIWTTRLLVQGSLRAEELSIQVSCRIMGITL 

VSKKANQQLNFIEAKEACRLXGLSLAGKDQ 

VETALKASFETCSYGWVGDGFWISRISPNPK 

CGKNGVGVLIWKVPVSRQFAAYCYNSSDTW 

TOSOTEIITTKDPIFNTQTATQTTEFTVSDSTyS 

VASPYSTIPAPTTTPPAPASTSIPRRKKLICVTE 

VFMETSTMSTETEPFVENKAAFKNEAAGFGG 

VPTALLVLALLFFGAAAGLGFCYVKRYVKAF 

PFINKNQQKEMIETKVVKEEKANDSNPNEES 

KKTDKNPEESKSPSKTTMRCLEAEV 


1000 


2350 


A 


8406 


2 


777 


KERCQFWKPMLSTVGSFLQDLQNEDKGIKT 
AAIFTAIX31«JMISASTLMDILLMNDFKLVINK1 
AYDVQCPKREKPSNEHTAEMEHMKSLVHRL 
FTBUHLEESQKKREHHLLEKIDHLKEQLQPLE 
QVKAGIBAHSEAKTSGLLWAGLALLSIQGGA 
LAWLTWWVYSWDIMEPVTYFITFANSMVFF 
AYFIVTRQDYTYSAVKSRQFLQFFHKKSKQQ 
HFDVQQYNKLKEDLAKAKESLKQARHSLCL 
QMQVEELNEKN 


1001 


2331 


A 


8410 


1400 


264 


VOFWERPLRSSRIVFRRSLRRWEMLARAARG 

TGALLLRGSLLASGRAPRRASSGLPRNTWLF 

VPQQEAWWERMGRFHRILEPGLNIUPVLDR 

IRYVQSLKEIVINVPEQSAVTLDNVTLQIDGV 

LYLRIMDPYKASYGVEDPEYAVTQLAQTTM 

RSELGKLSLDKVFRERESLNASrVDAINQAAD 

CWGIRCLRYEIKDIHVPPRVKESMQMQVEAE 

RRKRATVLESEGTRESAINVAEGKKQAQILAS 

EABKAEQINQAAGEASAVLAKAKAKAEAIRI 

LAAALTQHNGDAAASLTVAEQYVSAFSKLA 

KDSNnLLPSNPGDVTSMVAQAMOVYGALT 

KAPVPGTPDSLSSGSSRDVQGTDASLDEHJDR 

A/KMS 


1002 


23S2 


A 


8421 


134 


941 


NRENLLESRMMDPCSVGVQLRTTNECMKTY 

YTRHTGFKTLQELSSNDMLLLQLRTGMTLSG 

NNTICFHHVKlYIDRFEDLQKSCXaDPFNIHKKL 

AKKNLHVIDLDDATFLSAKFGRQLVPGWKLC 

PKCTQUNGSVDVDTEDRQKRKPESDGRTAK 

AUISLQFTNPGRQTEFAPETGKREKRRLTKN 

ATAGSDRQVIPAKSKVYDSQGLLIFSGMDLC 

DCLDEDCLGCFYACPACGSTKCXJAECRCCRK 

WLYEQIEIEGOEIIHNKHAG 


1003 


2353 


A 


8427 


3 


1416 


TEWGLSGSCPGCSPLEPGSRGRGAAAWRILR 

CRRLPEPSPFLTQPNLAQSQPPAPVPVTDPSVT 

MHPAVFLSLPDUlCSIXlXVTWVriTVTrErr 
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Amino acid sequence (A=Alanine CXJysteine, 
D=Aspartic Acid, E=01utamic Acid, 
F=PhenyIalanine, G=Glycinev H=H^tldine, 
I=Isoleucine, K=Lysine, IMjeudne, 
M=Metliionine, l*=A^jaragine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=nireonine, V=Valine, W=Tiyptophan, 
Y=Tjrrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, \=^ssible 
nucleotide insertion 














SLDTENIDEILNNADVALVNFYADWCRFSQM 

LHPIFEEASDVIKEEFPNENQWFARVDCDQH 

SDIAQRYWSKYPTLKlJHRNGMlvflylKREYRQQ 

RSVKALADYIRQQKSDPIQEIRDLAEnTLDRS 

KRhffiGYraQKDSDNYRVFERVANILHDDCAF 

LSAFGDVSKPERYSGDNIIYKPPGHSAPDMVY 

LGA^^^^1FDVTYNW1QDKCVPLVREITFBNGE 

ELTEEGLPFLILFHMKEDTESLEIFQNBVARQL 

ISEKGTtNFLHADCDKFRHPLLfflQKTPADCP 

VIAIDSFRHMYVFGDFKDVLIPGKLKQFVFDL 

HSGKLHREFHHGPDPTDTAPGEQAQDVASSP 

PESSFQKLAPSEYRYIXLRDRDEL 


1004 


2354. 


A 


8432 


910 


387 


GLSRKLRAGFLPGFCRVSPCGSWWETLVKM 

ACAAARSPADQDRHCIYPAYLNNKKTIAEGR 

RIPISKAVH^PTATEIODVCSAVGLNVFr FKN 

KMYSREWNRDVQYRGRVRVQLKQEDGSLC 

LVQFPSRKSVMLVAAEMIPKUCTRTXJKTGGA 

DQSLQQGEOSKKGKGKKKK 


IOCS 


2355 


A 


8453 


90 


530 


QSHETKMQSGTHWRVLGLCLLSVGVWGQD 

GNEEMGGrTQTPYKVSISGTTVILTCPQYPGSE 

BLWQHNDKNIGGDEDDKNIGSDEDHLSLKEF 

SELEQSGYYVCYPRGSKPEDANFYLYLRARG 

NPGLQNRYHRLFREDHSKGHSQ 


1006 


2356 


A 


8458 


3 


307 


AVQRIRHEMNIFRLTGDLSHLAAIVILLLKIW 
KTRSCAGISGKSQLLFALVFTTRYLDLFrSFlS 
LYNTSMKVWAIHRm^ira.QCTGLWTLNLC 
QLCEFN 


1007 


2357 


A 


8459 


43 


553 


GAGAGGDWAAMDKLKKVLSGQDTEDRSGL . 
SEWEASSLSWSTRIKGFIACFAIGILCSLLGT 

vllwvprkguilfavfytfgniasigstiflm 

GPVKQUaUvlFEFrRLIATIMVLLCFALTLCSA 

fwwhnkglalifcilqslaltwyslsfipfar 
davkkcfavcla 


1008 


2358 


A 


8462 


487 


150 


aqdirsvhslgqkstfvkhfrtlshlhglpdp 
pphwppqersppshpqklpshrpqipqlsnsgps 
dprwgcvgpsmftstclpgaveasttkaslp 
kcpvdsslpepeacfl 


1009 


2359 


A 


8465 


134 


954 


ETRVKTSLELLRTQIJEPTGTVGNnMTSQPVP 

nettivlpsnvinfsqaekpeptnqgqdslkkh 

lhaeikvigtiqilcgmmvlslgiilasasfspn 

ftqvtstllnsaypfigpfffnsgslsiatekrl 

tkllvhsslvgsilsalsalvafiilsvkqatl 

npaslqceldknniptrsyvsyfyhdslyttd 

CyTAKASIj\GTLSIJ«lUCIliEFCLAVLTAVL 
RWKQAYSDFPGSVLFLPHSYIGNSGMSSKlVfr 
HDCGYEELLTS 


1010 


2360 


A 


8468 


2 


473 


kyryrrpypvmrkicqvopaglafilnispva 

hrvalchlagcqeqaAwyhtlqilfflvsay 

ffscpvpekyfpgscdivghghqifhaflsicr 

lsqleahidyqgrqeiflqrhgplsvhmacl 

sfffi^csaataaixjuikvkarltkkds 


1011 


2361 


A 


8478 


5 


409 


telsqlekahppadmgrrkskrkpppkkkmt 
gtletqitx^pfcjniekscdvkmdrarntgvi 
sctvcljeefqtpitcilgnlgffqrvgrglesg 
pcssgplcalvqgqsrpeeqvppsdfcgvrrc 

RAGFQCQ 


1012 


2362 


A 


8481 


2810 


1652 


RTSTQKWQSVFNDSQEHLERFYCNPENDRM 
RMKYGGQEFWADLNAMNVYETreFDQLRR 
LSTPPSSNVNSmnVWKFFCRDHFGWREYPE | 
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I>=Aspartic Acid, E^^Glutamic Add, 
F=Phenylalarine, GKJlycine, IWIistidinc, 
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Y=Tyrosine, X=Unknown, *=Stop codon, 
^=^ssible nucleotide deletion, V^ssible 
nucleotide insertion 














SVIRUEEANSRGLKEVRFMMWNNHYIUINS 
FFRREDCRRPLFRSCFTLLPYLOTLOCtVPTOAP 

PPLEATSSSQnCPDGVTSANFYPETWVYMHP 

SQDnQVPVSAEDKSYRIIYNLFHKTVPEFKYR 

DLQILRVQNQFLWEKYKRKKEYMNRKMFGR 

DRIINERHLFHGTSQDVVDGICKHNFDPRVCG 

KHATMFGQGSYFAKKASYSHNFSKKSSKGV 

VTSDLYDSCVDNFFEPQIFVIFNDDQSYFYFVI 
QYEEVSNTVSI 


1013 


2363 


A 


848B 


2 


517 


lENCRTRLRQAWHEVCGNKMAAPIPQGFSCL 

PIYQPKFKTEKEFMQHARKAGLVIPPEKSDRS 
MLACTAGIFDAYVPPEGDARISSLSKEGLIER 
TERMKKTMASQVSnUUKDYDANFKKDFPE 
KAKDIFIEGSPLY 


1014 


2364 


A 


8301 


363 


17 


YmTGY\nncnYAQIAlYTYYIRTAYVYICILY 
AQLAfYTYVLYTHSLOHMySIRTAYVYlCnY 
AQIMYTYVFyTHRLCIHMYSIRTOYVYICaLY 
AQLMYTYVFrmSYMSDE 


lOlS 


236S 


A 


8504 


3 


2190 


NSSEHFSQAPQRLSFYSWrYOSARLFRTRVPPD 

AVLLRWLLQVSRESGAACTDAEITVHFRSQA 

PPVINPLGTSFPDDTAVQPSFQVGVPLSTIPRS 

NASVNVSHPAPGDWFVAAHLPPSSQKIELKG 

LAPTCAYVFQPELLVTRWEISIMEPDVPLPQ 

TLLSHPSYLKVFVPDYTRELLLELRDCVSNGS 

LGCPVRLTVGPVTLPSNFQKVLTCTGAPWPC 

RLLLPSPPWDRWLQVTAESLVGPLGTVAFSA 

VAALTACRPRSVTIQPLLQSSQNQSFNASSGL 

LSPSPDHQDLGRSGRVDRSPFCLTNYPVTRED 

MDWSVHFQPLDRVSVRVCSDTPSVMRLRL 

KIXSMDSGGSLTISLRANKTHvIRNETVVVACV 

NAASPFLGFNTSLNCTTAFFQGYPLSLSAWSR 

RANLIIPYPETDNWYLSLQLMCPENAEDCEQ 

AVVHVETIXYLVPCLNDCGPYGQCLLLRRHS 

AATLLLTLSNLMFLAPUVSVRRFFLVEASVY 

AYTMFFSTFYHACDQPGEAVLOLSYDTLQY 

CDFLGSGAAIWVmcavIAELKTVLKyVLFLL 

GTLVIAMSLQLDRRGMWNMLGPCLFAFVIM 

ASMWAYRCGHRRQCYPTSWQRWAFYLLPG 

VSMASVGIAIYTSMMTSDNYYYTHSIWHILL 

AGSAALLLPPPDQPAEPWACSQKFPCHYQIC 

KNDREELYAVT 


1016 


2366 


A 


8S1I 


1 


453 


KWYPSOPVRIPORFYYKLPAOHRRCRMAPAK 

KGGEKKKGRSAlNEWTREynNIHKRlHGVG 

FKKRAPRALKEXRKFAMKEMGTPDVRIDTRL 

NKAVWAKGIRNVFYRIRVRLSRKRNEDEDSP 

NKLYTLVTYWVTTFKNLQTVNVDEN 


1017 


2367 


A 


8S13 


54 


1196 


LERTPASADMAWTKYQLFLAOLMLVTQSINT 

LSAKWADNFMAEGCGGSKEHSFQHPFLQAV 

GMFLGEFSCXAAFYLLRCRAAGQSDSSVDPQ 

QPFNPLLFLPPALCDNfTGTSLMYVALNMTSA 

SSFQMLRGAVnFTGLFSVAFLGRRLVLSQWL 

GILATIAGLWVGLADlXSKHDSQHKLSEVrr 

GDLUIMAQnVAIQMVLEEKFVYKHNVHPLR 

AVOTEGLFGFVILSLLLVmYYIPAGSFSGNP 

RGTLEDALDAFCQVGQQPUAVALLGNISSIA 

FFMTAGISVTKELSATTRMVLDSLRTVVIWAL 
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F=Phenylalanine, G=Glycine, H=Histidine, 
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^*=Methionine, N^Aqjaiagine, P=Proline. 
Q=Glutaniine, R=Aiginine, S=Serine, 
T=Thieonine, V=Valine, W=Tiyptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/^possible nucleotide deletion, V=^ssible 
nucleotide insertioa 














SLALGWEAFHALQILGFLILUGTALYNGLHR 
PLLGRLSRGRPLAEESEQERLLGGTRTPINDA 
S 


1018 


2368 


A 


8S18 


324 


694 


SPFWTEKRRMEia»LFPLVPLHWFGFGYTALV 

VSGOIVGYVKTGSVPSLAAGLLFGSLAGLGA 

YQLYQDPRNVWOFLAATSVTFVGVMGMRS 

yYYGKFMPVGUAGASLLMAAKVGVRNILM 

TSD 


1019 


2369 


A 


8S26 


2 


1787 


VSAAAYNMEPPDAPAQARGAPRLLLLAVLL 

AAHPDAQAEVRLSVPPLVEVMRGKSVILDCT 

PTGTHDHYMLEWFLTDRSGAIU»KLASAEMQ 

GSELQVTMHDTRGRSPPYQLDSQGRLVLAEA 

QVGDERDYVCWRAGAAGTAEAAARLNVF 

AKPEATEVSPNKGTLSVMEDSAQEIATSNSRN 

GNPAPKTTWYRNGQRLEVPVEMNreGYMTS 

RTVREASGLLSLTSTLYLRLRKDDRDASFHC 

AAHYSLPEGRHGRLDSPTFHLTLHYFTEHVQ 

FWVGSPSTPAGWVREGDTVQLLCRGDGSPSP 

EYTLFRLQDEQEEVLNVNLEGNLTLEGVTRG 

QSGTYGCRVEDYDAADDVQLSKTLELRVAY 

LDPLELSEGKVLSLPLNSRAWNCSVHGLPTP 

ALRWTKDSTPLGDGPMLSLSSITFDSNGTYVC 

EASLPTVPVLSRTQNFTLLVQGSPELKTAEIEP 

KADGSWREGDEVTLICSARGHPDPKLSWSQL 

GGSPAEPEPGRQGWVSSSLTLKVTSALSRDGI 

SCEASNPHGNKRHVFHFGTVSPQTSQAGVAV 

MAVAVSVGLLLLWAVFYCVRRKGGPCCRQ 

RREKGAP 


1020 


2370 


A 


8530 


2 


1200 


PRVRLLRPSRSRSCRGLLSTRAPGPSPFRSLHS 

SPLLPHAMKSPFYRCQNTTSVEKGNSAVMGG 

Yli^STGLLGNLLALGIXARSGLGWCBRRPLR 

PLPSVFYMLVCGLTVTDLLGKCLLSPWLAA 

YAQNRSLRVLAPALDNSLCQAFAFFMSFFGL 

SSTLQLLAMALECWLSLGHPFFYRRHITUtLG 

ALVAPWSAFSLAFCALPFMGFGKFVQYCTC 

TWCFIQMVHEEGSLSVLGYSVLYSSLMALLV 

LATVL<>II^AMRNLYAMHRRLQRHPRSCTR 

DCAEPRADGREASPQPLEELX)HLLLLAIAfrV 

LFTMCSLPVTYRAYYGAFKDVKEKNRTSEEA 

EDLRALRFLSVISIVDPWIFIIFRSPVFRJFFHKI 

FIRPLRYRSRCSNSTNMESSL 


1021 


2371 


A 


8S36 


1 


237 


RRGEIDMATEGDVELELEl'Kl'SGPERPPEKPR 
KHDSGAADLERVTDYAEEKEIQSSNLETAMS 
VIGDRRSREQKAKQER 


1022 


2372 


A 


8537 


94 


541 


RKERRRRRRRMEAVVFVFSLLDCCAUFLSV 
YFinXSDLECDYINARSCCSKLNKWVlPELIG 
HTIVTVLLLMSUIWFIFUJllJPVATWNIYRYI 
MVPSGNMGVFDPTEIHNRGQLKSHMKEAMI 
KLQFHL1X3TTI4YLYSMILALIND 


1023 


2373 


A 


8540 


26 


431 


KMMKCPQALLAIFWLLLSWVSSEDKWQSPL 

SLVVHEGDTVIXNCSYEVTNFRSLLWYKQEK 

KAPTFLFMLTSSGIEKKSGRLSSILDKKELSSIL 

NITATQTGDSArYLCAVEAQCSLVTCSLYSNS 

TAEALQL 


1024 


2374 


A 


8544 


1731 


743 


OVRLRYSPIAWMVGEAGRDLRRRRAVAVT 

AEKMAVLAPLIALVYSVPRLSRWLAQPYYLL 

SAIiSAAFLLVRKLPPLCHGLPTQREDGNPCI) 

FDWREVEEJvlFLSArVMMKNRRSrrVEQHIGN 

lFMFSBrVAKriLFTRLDIRMGIX,YrrLCIVFLM 
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D=Aspartic Acid, EFGIutamic Acid, 
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T=Threonine, V=Valine, W=Tiyptophan, 
Y=Tyrosiiie, X^Uoknown, *=Stop codon, 
/"^possible nucleotide deletion, V=possiblc 
nucleotide insertion 














TCKPPLYMGPEYIKYFNDKTIDEELERDKRVT 

WIVEFFANWSNDCQSFAPIYADLSLKYNCTG 

LNFGKVDVGRYTDVSTRYKVSTSPLTKQLPT 

LILFQGGKEAMRRPQIDKKGRAVSWTFSEEN 

VIREFNLNELYQRAKKLSKAODNIPEEQPVAS 

TPTTVSDGENKKDK 


1025 


2375 


A 


8546 


2194 


1707 


TVSFHKTMASLKCSTWCVICLEKPKYRCPA 
CRVPYCSWCFRKHKEQCNPETRPVEKKIRS 
ALPTKTVKPVENKDDDDSIADFLNSDEEEDR 
VSLQNLKNLOESATLRSLLLNPHLRQLMVNL 
DQGEDKAKLMRAYMQEPLFVEFADCC1,GIV 
EPSQNEES 


1026 


2376 


A 


8547 


1078 


594 


VGMELPAVNLKVILLGHWLLTTWGCIVFSGS 

YAWANFmALGVWAVAQRDSIDAISMFLGG 

LLATBFLDFVmSIFYPRVSLTDTGRFGVGMAIL 

SLLLKPLSCCFVYHMYRERGGELLVHTGFLG 

SSQDRSAYQTIDSAEAPADPFAVPEGRSQDAR 

GY 


1027 


2377 


A 


8557 


1 


340 


DFLGPASPQEEGGSESSTMFELETAMGMIIDV 
FSRYSGSEGSTQTLTKGELKVLMEKELPGFLQ 
SQKDKDAVDKLLKDLDANODAQVDFSEFIVF 
VAATTSACHKYFEKAGLK 


1028 


2378 


A 


8569 


20 


963 


KMAATLGPLGSWQQWRRCLSARDGSRRLLL 

LLLLGSGQGPQQVGAGQTFEYLKREHSLSKP 

YQGEAI'RPCFLRDWELQVHFKIHGQGKKNL 

HODQLAIWYTKDRMQPGPVFGNMDKFVGLO 

VFVDTYPNEEKQQERVFPYISAMVNNGSLSY 

DHERDGRPTELGGCTAIVRNLHYDTFLVIRY 

VKIOILmiMDIDGKHEWRDCIEVPGVRLPRG 

YYFGTSSTTGDLSDNHDVISLKLFELTVERTPE 

EEKLHRDVFLPSVDNMKLPE\rrAPLPPLSGL 

ALFLIVFFSLVFSVFArVlGIILYNKWQEQSRK 

RFY 


1029 


2379 


A 


8572 


1 


578 


AAAASHRSRARSRPRRVSSGPAPRRAQSSAG 

RVASGUJSAPLCTMARALCRLPRRGLWLLLA 

HHLFMTTACXJBANYGALLRELCLTQFQVDM 

EAVGETLWCDWGRTIRSYRELADCTWHMAE 

KLOCFWPNAEVDKFFLAVHGRYFRSCPISGR 

AVMJPPGSILYPFIVVPITVrLLVTALVVWQS 

KRTEGIV 


1030 


2380 


A 


8574 


1352 


372 


DSSTVKGGSESRHLCLIPDLKGKARTREASSG 

SRTCGRRTSLCTSAKSSWTYRSGRLSWQSIKG 

THLTTTQALRQPUIRAPLLPGQLCWSPRPLEK 

NKAMQRPLLU>LLX1LQP?AFLQPGGSTGSGP 

SYLYGVTQPKHLSASMGGSVEIPFSFYYPWEL 

AIVPNVRISWRRGHFHGQSFYSTRPPSIHKDY 

VNRLFLNWTEGQESGFUUSNLRKEDQSVYF 

atVEUDTRRSGRQQLQSKGTKLTrrQAVTTT 

TTWRPSSrrnAGLRVTESKGHSESWHLSLDT 

AIRVALAVAVLKTVILGLLCLLLLWWRRRKG 

SRAPSSDF 


IVJl 




A 

A 


8580 


905 


340 


RRTAGIYPCFPKPGRTRHALCSWLLLLTGQL 

AFDDFQESCANIMWQKYAGSRRSMPLGARIL 

FHOVFYAGGFAIVYYLIQKFHSRALYYKLAV 

EQLQSHPEAQBALOPPLNIHYLKUDRENFVDI 

VDAKLKIPVSGSKSEGIXYVHSSRGGPFQRW 

HLDEVFLELKDGQQIPVFKLSGENGDEVKKE 


1032 


2382 


A 


8593 


2558 


961 


RRRPRLLPGAEPCEPRVGPRRADMGCSAKAR 
WAAGALGVAGLLCAVLGAVMIVMVPSLnCQ 
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SEQID 
NO: of 
nocl- 
eoUde 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

begjoning 

nucleotide 

location 

conespondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
coiresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=PhenyIalanine, G^Glycine;, IMIistidine, 
I=Isoleucine, K=Lysinc, L-Leocine, 
M^^ethionine, N=Aspaiagine, P=Pn>line, 
Q=Glutaniine, R^Aiginine, S=Sraine, 
Tt=Threonine, V=Valine, W=Tiyptoplian, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
A^ssible nucleotide deletion, V^xissible 
nucleotide insertion 














QVLKNVRIDPSSLSFNMWKEIPIPFYLSVYFFD 

VMNPSEnjCGEKPQYRERGFYVYREFRHKSNI 

TFNNNDTVSFLEYRTFQFQPSKSHGSESDYIV 

MPNILVLGAAVMMENKPMmCLIMTLAFrTL 

GERAFMNRTVOEIMWGYKDPLVNLINKYFP 

OMFPFKDKFGLFAELNNSDSCHJTGFTGVQNl 

SRIHLVDKWN(H.SKVDFWHSDQCNM1NGTS 

GQMWPPFMTPESSLEFYSPBACRSMKLMYKE 

SGVFEGZPTYlUrVAPKTLFANGSIYPPNEGFCP 

CLESGIQNVSTCRFSAPLFLSHPHFLNADPVL 

AEAVTGLHPNQEAHSLFLDIHPVTGIPMNCSV 

KLQLSLYMKSVAGIGQTGKIEPWLPLLWFA 

ESGAMEGETLHTFYTQLVIMPKVMHYAQYV 

LLALGCVLLLVPVICQIRSQEKCYLFWSSSKK 

GSKDKEAIQAYSESUvTTSAPKGSVLQEAKL 


1033 


2383 


A 


8595 


595 


767 


AHLPDTLLLPPHSPTVFIPKSFQCSQKACFSRS 
FCLLLSLVSSSLVSLSLCPPLTQA 


1034 


2384 


A 


8597 


640 


164 


VTTSCnPFAFGLGVRASERLAElDMPYLLKYQ 

PMMQTIGQKYCMDPAVUGVLSRKSPGDKIL 

VNMGDRTSMVQDPGSQAPTSWISESQVFQTT 

EVLTTRnELQRRFPTWTPDQYLRGGLCAYSG 

OAGYVRSSQDLSCDFCm)VLARAKYLKRHO 

F 


I03S 


238S 


A 


8603 


936 


204 


AMASTLEYSPSPLRRLVGPAAGFSRAARADL 

SWDPMAEFTGLWGPFTCVSRVLSHHCFSTTG 

SLSAIQKMTRVRWDNSALGNSPYHRAPRpi 

HVYKKNGVGKVGDQILLAIKGQKKKALrVG 

HCMPGPRMTPRFDSNNWLIEDNGNPVGTRI 

KTPIFTSLRKREGEYSKVLAIAQNFV 


1036 


2386 


A 


8606 


1 


S62 


PTRAHSFDLCXISPCRRRLLGREEAGEEPTSPV 

TQYLQPRSPEECKMFACAKLACTPSLIRAOSR 

VAYRPISASVLSRPEASRTGEGSTVFNGAQNG 

VSQLIQREFQTSAISRDIDTAAKFIGAGAATVG 

VAGSGAGIGTVFGSLnGYARNPSLKQQLFSY 

AILGFALSEAMGLFCLMVAFLILFAM 


1037 


2387 


A 


8615 


2 


2364 


SPGPSLPESAESLDGSQEDKPRGSCAEKrFrDT 

GMVAHINNSRLKAKGVGQHDNAQNFGNQSF 

EHJIAACLRKGELFEDPLFPAEPSSLGFKDLG 

PNSKNVQNISWQRPKDIlNNPLFIMDaiSPTDI 

CQGILGDCWLLAAIGSLTTCPKLLYRWPRG 

QSFKKNYAGIFHFQIWQFGQWVNWVDDRL 

PIKNDKLVFVHSTERSEFWSALLEKAYAKLS 

GSYEALSGGSTMEGLEDFTGGVAQSFQLQRP 

PQNLLRLLRKAYERSSLMGCSIEVTSDSELES 

MTDKMLVRGHAYSVTQLQDVHYRGKMETU 

RVKNPWGRIEWNGAWSDSAREWEEVASDIQ 

MQXJLHKIEDGEFWMSYQDFLNNFTIXEICNL 

TPDTLSGDYKSYWHTTFYEGSWRTGSSAGGC 

RNHPGTFWTNPQFKISLPEGDDPEDDAEGNV 

WCTCLVALMQKNWRPIARQQGAQLQTIGFV 

LYAVPKEFQNIQDVHLKKEFFTKYQDHGFSEI 

FTNSREVSSQLRLPPGEYmPSTFEPHRDADFL 

LRVFTEKHSESWELDEVNYAEQLQBEKVSED 

DMDQDFLHLFKTVAGEGKEIGVYELQRLLNR 

MAIKFKSFKTKGFGLDACRCMINLMDKDGSG 

KIXJIXEFKILWKKLKKWWDDIERECDQDHSGT 

LNSYEMRLVIEKAGIKLNNKVMQVLVARYA 

DDDLITOFDSFISCFIJRIJnMFTEELTMDPKNT 

GmCLSLEQVLGEGWEGICRIAPACTSTPPPPS 



296 



wo 01/57188 



PCT/USOl/03800 



SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
pq)tide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted Old 
nucleotide 
location 
correqxmding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=AIanine C=Cysteme, 
I>=Aspartic Acid, EKjlutamic Acid, 
F°4>benylalanine, G=Glycine, H=Histidine, 
I=IsoIeucine, K=i,ysine, Iy<«icine, 
M=Metlii(mine, N^Aspaiagine, P=Proline, 
Q=Ghitainine, R=Argiiiine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codoii, 
/=possible nucleotide deletion, V=possibIc 
nucleotide insertion 














SDVPGPASCPRLFPPWDLXPVSrrVAADDHVGl 
HAL 


1038 


23S8 


A 


8621 


3 


1494 


RSRMARAPLGVLLLLGLLGRGVGKNEELRLY 

HHLFNNYDPGSRPVREPEDTVnSLKVTLTNL 

ISLNEKEETLTTSVWIGIDWQDYRLNYSKDDF 

GGIBTLRVPSELVWLreiVLENNlDGQFGVAY 

DANVLVYEGGSVTWLPPAIYRSVCAVEVTYF 

PFDWQNCSLIFRSQTYNAEEVEFrFAVDNDG 

KTINKJDIDTEAYTENGEWAIDFCPOVIRRHH 

GOATDGPGETDVIYSLlIRRKPLFYVINnvpr'V 

LISGLVLLAYFLPAQAGGQKCTVSmVLLAQT 

VFLFLIAQKIPETSLSVPULGRFUFVMWATU 

VMNCVm.NVSQRTPTTHAMSPRLRHVLLEL 

LPRIXGSPPPPBAPRAASPPRJRASSVGLLLRAB 

ELDLKKPRSELVFEGQRHRQGTWTAAFCQSL 

OAAAPEVRCCVDAVNFVAESTRDQEATGEE 

VSDWVRMGNALDNICFWAALVLFSVGSSLIF 

LGAYFNRVPDLPYAPaqP 


1039 


2389 


A 


8636 


1 


900 


PGRERPGGGGARRRPQHLPALLPSERPDCATL 

NSSSGGSGRPTGPQISVYSGIPDRQTVQVIQQ 

ALHRQPSTAAQYLQQMYAAQQQHLMLQTA 

ALQQQHLSSAQLQSLAAVQQASLVSNRQGST 

SGSNVSAQAPAQSSSINLAASPAAAQIXNRA 

QSVNSAAASGIAQQAVLLGNTSSPALTASQA 

QMYXRAQMUFTPTATVATVQPELGTGSPAR 

PPTPAQVQNLTLRTQQTPAAAASGPTPTQPVL 

PSLALKPTPGGSQPLPTPA 


1040 


2390 


A 


8643 


98 


1388 


ASQLAFGGKLTSTPSRDFQGCGRGAVTCCSF 

HEHRHQSGRCLSTGMAPNLKGRPRKKKPCPQ 

RRDSFSGVKDSNNNSDGKAVAKVKCEARSA 

LTKPKNNHNCKKVSNEEKPKYAIGEECRADE 

QAFLVALYKYMKERKTPIERIFYLGFKQINLW 

TMFOAAOKLGGYETITARROWXHIYDFT GO 

NPGSTSAATCTOKHYERULPYERFIKGEEDKP 

UTKHRKQENSSQENENKTKVSGTKRIKHEIP 

KSKKEKENAPKPQDAAEVSSEQEKEQBTUSQ 

KSIPEPLPAADMKKKIEGYQEFSAKPLASRVD 

PEKDNETDQGSNSEKVAEEAGEKOPTPKLPSA 

PLAPEKDSALVPOASKQPLTSPSALVDSKQES 

KLCCFTESPESEPQEASFPRLPHHTGHRWQTR 

MRRRMTNCPPWQITLPTAP 


1041 


2391 


A 


8646 


113 


1492 


LLQEMCTKTlPVLWGCFLLWNLYVSSSQnYP 

GIKARITQRALDYGVQACHvaCMIEQMLKEKK 

LPDLSGSESLEFLKVDYVNYNFSNIKISAFSFP 

NTSLAFVPGVGIKALTNHGTANISTDWGFESP 

LFVLYNSFAEPMEKPILKNLNE\flLCPnASEVK 

ALNANLSTLEVLTKIDNYTLLDYSLISSPEITE 

NYLDLNLKGVFYPLENLTOPPFSPVPFVLPER 

SNSMLYIGIAEYFFKSASFAHFTAGVFNVTLS 

TEEISNHFVQNSQGLGNVLSRIAEIYILSQPFM 

VRIMATEPPDNUJPGNFTLDIPASIMMLTQPK 

NSTVETrVSMDFVASTSVGLmGQRLVCSLS 

LNRFRIALPESNRSNIEVLRFENILSSILHFGVL 

PIANAKLQQGFPLPNPHKFLFVNSDIEVLEGF 

LLISTOLKYETSSKQQPSFHVWEOLNIJSRQW 

ROCSAP 


1042 


2392 


A 


8672 


538 


170 


ARBIARTRESKAAVSQDNVPALQPGKKKKLR 
IXKJKKKKFKFFRLPKEFKKQLMYSPSNFKKM 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
pq>tide 
seq- 

UGDCe 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
05'/49o 

914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ngtDfiist 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
add residue 
of peptide 
sequence 


Amino acid sequence (A<=Alanine OCysteine^ 
I>=A£particAcid, E=Glutamic Acid, 
F»Phenylalanine, G=Glycine, It^Histidine, 
I-Isolcucine, K=Lysine, IM^cucinc, 
M=Met]iifxiine, N^Asparagine, P=Prolinei 
Q=Ghitainine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tiyptophan, 
Y=T5TOsine, X=Unknowii, ♦=Stop codon, 
/=^possible nucleotide deletion, V=possible 
nucleotide insertion 














TSLAGNTVQCaj«CLKYVIYSAQYPAYGNnT 
LDMITSTDHVLEQDFWianrFYSVKERQI 


1043 


2393 


A 


8688 


359 


17 


GLKTRAPAIKIFQREVUJPAKQDMQRRCPRI 
GLMTSLLKPDCRRWKDYKRWKSGGFTGESC 
HHADTLGDRGGLQGDHSHLLQWQKRILRTE 
GEPSPKYISKNIFPICSYITGFL 


1044 


2394 


A 


8718 


292 


1490 


GTVKTSVATPITAGHSCSSGGVLQVKSPATQS 

GFKFTSKMH)FNMESDSFEDFWKGEDLSNYS 

YSSTLPPFIXDAAPCEPESLJEINKYFWnYAL 

VFLLSLLONSLVMLVILYSRVORSVTDVYLL 

NLALADLLFALTLPIWAASKVNGWIFGTFLC 

KWSLLKEVNFYSGILLLAaSAnDRYLArVHA 

TRTLTQKRYLVKFICLSrWGLSLJLLALPVLLFR 

RTVYSSNVSPACYHJMGNNTANWRMLLRIL 

PQSFGFIVPLUMLFCYGFTLRTLFKAHMGQK 

HRA\lRVIFAVVIJFLUnVLPY>nLVUj\DTLM 

RTQVIQBrCERRNHIDRALDATElLGILHSCLN 

PUYAnGQKTiaiGIXKILAIHGLISKDSLPKDS 

RPSFVGSSSGHTSTTL 


1043 


2395 


A 


8724 


254 


3184 


REIANLAITVANRRGAQGGKMHTCCPPVTLEQ 

DLHRKMHSWMLQTLAFAVTSLVLSCAETIDY 

YOEICDNACPCEEKDGILTVSCENRGIISLSEIS 

PPRFPIYHLLLSGNLLNRLYPNEFVNYTGASIL 

HLGSNVIQDIETGAFHGLRGLRRLHLNNNKL 

ELLRDDTFLGLENLEYLQVDYNYISVIEPNAF 

GKIJIIXQVULNDNLLSSLPNNLFRFVPLTHL 

DLRGNRLKLLPYVGLLQHMDKWELQLEEN 

PWNCSCELISLKDWLDSISYSALVGDWCETP 

FRLHGRDLDEVSKQELCPRRLISDYEMRPQTP 

LSTTGYLHnPASVNSVATSSSAVYKPPLKPP 

KGTRQPNKPRVRPTSRQPSKDLGYSNYGPSIA 

YQTKSPVPLECPTACSCNLQISDLGLNVNCQE 

RKIESIAELQPKPYNPKKMYLTENYIAWRRT 

DLLEATGLDLLHLGNNRISMIQDRAFGDLTN 

LRRLYLNGNRIERLSPELFYGLQSLQYLFLQY 

NIJRmQSQTFDPVlTgLQUJFLNNNLLQAMPS 

GVFSGLTUJRLNLRSNHFTSLPVSGVUXJLKS 

LIQIDUIDNFWDCTCDIVGMKLWVEQLICVG 

VLVDEVICXAPKKFAETDMRSIKSELLCPDYS 

DVWSTFTPSSIQVPARTSAVTPAVRLNSTGA 

PASLOAGGQASSVPLSVLILSLLLVFIMSVFVA 

AGLFVLVMKRRKKNQSDHTSTNNSDVSSFN 

MQYSVYGGGGGTGGHPHAHVHHRGPALPK 

VKTPAGHVYEYIPHPLGHMCKNPrYRSREGN 

SVEDYKDLHELKVTYSSNHHLQQQQQPPPPP 

QQPQQQPPPQLQLQPGEEERRESHHLRSPAYS 

VSTIEPREDLLSPVQDADRFYRGILEPDKHCST 

TPAGNSLPEYPKFPCSPAAYTFSPNYDLRRPH 

QYLHPGAGDSRLREPVLYSPPSAVFVEPNRNE 

YLBLKAKLNVEPDYLEVLEKQTTFSQF 


iU*iO 




A 


8736 


28 


452 


SPSAAGOLAWVSLALGSGSRGRDHSGSOVGT 
AMAUAijVKKAAUy VKoltUrKDYliMalHrW 
OPVANWGLPIAAINDMKKSPEnSGRMTFALC 
CYSLTFMRFAYKVQPRNWLLFACHATNBVA 
QUQGGRLIKHEMTKTASA 


1047 


2397 


A 


8741 


673 


924 


ALPGTPQQTVTLNTOGKVKSFTSPHSNPNLPP 
AKFFTSLQSLNWSSHLPPSPATESVGKRGNAK 
PPTTKLLHSSPLWNFFAQQL 


1048 


2398 


A 


8747 


3 


5054 


PEVTKPSLSQPTAASPIGSSPSPPVNGGNNAKR 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
p^tide 
seq- 
uence 


Met 
bod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino add 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
conxsponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=4^enylalanine, GKJIycine, H^Histidine, 
I-Isoleudoe, K=Lysiiic, L-Lcucine, 
M=Methionine, N=Aspaiagine, P=Proline, 
Q=GIutainine, R=Aiginine, S=Serine, 
T=Threonine, V=Valine, W=Tiyptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=possible nucleotide deletion, V=possible 
nucleotide insertion 














VAVPNGQPPSAARYMPREVPPRFRCQQDHK " 

VLLKRGQPPPPSC3«ILLOGOAGPFPCTAPGAN 

PNNAQVTGALLQSESGTAPDSUXWAAASNY 

ANSTWGSGASSNNGTSPNPnnWDKVIVDGS 

DMEEWPCIASKDTESSSENTTDNNSASNPGSE 

KSnPOSTTSNKGKGSQCQSASSGNBCNLGV 

WKSDPKAKSVQSSNSTTENNNGLGNWRNVS 

GQDRIGPGSGFSNFNPNSNPSAWPALVQEGTS 

RKGALETDNSNSSAQVSTVGQTSREQQSKME 

NAGVNFWSGREQAQIHNTDGPKNGNTNSL 

NLSSPNPMENKGMPFGMGLGNTSRSTDAPSQ 

STGDRKTGSVGSWGAARGPSGTOTVSGQSNS 

GNNGNNGKEREDSWKGASVQKSTGSKNDS 

WDNNNRSTGGSWNFGPQDSNDNKWGEGNK 

MTSGVSQGEWKQPTGSDELKIGEWSGPNQPN 

SSTGAWDNQKGHPLLENQGNAQAPCWGRSS 

SSTGSEVEGQSTGSNHKAGSSDSHNSGRRSy 

RPTHPDCQAVLQTLLSRroLDPRYLSNTGWG 

QTQIKQDTVWDIEEVPRPEGKSDKGTEGWES 

AATQTKNSGGWGDAPSQSNQMKSGWGELS 

ASTEWKDFKNTGGWNDYKNNNSSNWGGGR 

PDEKTPSSWNENPSKDQGWGGGRQPNQGWS 

SGKNGWGEEVDQTKNSNWESSASKPVSGWG 

EGGQNEIGTWGNGGNASLASKGGWEDCKRa 

PAWNETGRQPNSWNKQHQQQQPPQQPPPPQ 

PEASGSWGGPPPPPPGNVRPSNSSWSSGPQPA 

TPKDEH>SGWEEPSPQSISRKMDIDDGTSAWG 

DPNSYNYKNVNLWDKNSQGGPAPREPNLPTP 

MTSKSASDSKSMQDGWGBSDGPVTGARHPS 

WEEEEDGGVWNTTGSQGSASSHNSASWGQG 

GKKQMKCSLKGGNNDSWMNPLAKQFSNMG 

U^QTEDNPSSKMDLSVGSLSDKKFDVDKRA 

MNLGDFNDIMRKDRSGFRPPNSKDMGTTDS 

GPYFEKGGSHGLFGNSTAQSRGLimPVQPLN 

SSPSLRAQVPPQFISPQVSASMLKQFPNSGLSP 

GLFNVGPQLSPQQIAMLSQLPQIPQFQLACQL 

LLQQQQQQQLLQNQRKISQAVRQQQEQQLA 

RMVSALQQQQQQQQRQPCaoCHSPSHPVGPK 

PHUJNMVPNALNVGLPDLQTKGPIPGYGSGF 

SSGGMDYGMVGGKEAGTESRFKQWTSMME 

GLPSVATQEANMHKNGAIVAPGKTRGGSPY 

NQFDnPGDlLGGHTCB'AGDSWLPAKSPPTNK 

IGSKSSNASWPPEFQPGVPWKGIQNIDPESDP 

YVTPGSVLGGTATSPIVDTDHQLLRDNTTGS 

NSSLNTSLPSPGAWPYSASDNSFTNVHSTSAK 

FPDYKSTWSPDPIGHNPTHLSNKMWKNHISS 

RNTTPLPRPPPGLTNPKPSSPWSSTAPRSVRG 

WGTQDSRLASASTWSDGGSVRPSYWLVLHN 

LTPQIDGSTLRTICMQHGPLLTFHLNLTQGTA 

LIRYSTKQEAAKAQTALHMCVLGNTTILAEF 

ATDDEVSRELAQAQPPTPAATPSAPAAGWQS 

LETGQNQSDPVGPALNLFGGSTGLGQWSSSA 

GGSSGADLAGASLWOPPNYSSSLWOVPTVED 

PHRMGSPAPIiPGDLLGGOSDSI 


1049 


2399 


A 


8748 


200 


1387 


VPWKRQDEQLSLQVETLYU3SPAVIHLLSPTF 

LPPSSLPPFLQIVDSSSSAC11.DSFFPFLAPWDS 

PQDCGFKDHQPLTU}ALTVELARWTLMLLLS 

TAMYGAHAPLLALCHVDGRVPFRPSSAVLLT 

ELTKLLLCAFSLLVGWQAWPQGPPPWRQAA 

PFALSALLYGANNNLVIYLQRYMDPSTYQVL 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

cotrespondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A^Alanine OCysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycine, H^Histidine, 
l=Isoleucine, K=Lysine, L^Leucinc, 
M=Methionine, N=Asparagine, P=4*roline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=7hreonine, V=Valine, W=Tiyptopliaii, 
Y=Tyrosine, X'=UnIaiovvii, top codon, 
^=possible nucleotide deletion, V-possible 
nucleotide insertion 














SNLKIGSTAVLYCLCLRHRLSVRQGLALLLL 

MAAGACYAAGGLQVPGNTLPSPPPAAAASP 

NIPLHIIPLGLLLLILYCLISGLSSVYTELLMKR 

QRLPLALQNLFLYTFGVLLNLCILHAGGGSOP 

GLLEGFSGWAALWLSQALNGLLMSAVMKH 

OSSJTRLFWSCSLWNAVLSAVLLRLQLTAA 

FFLATLUGLAMRLYYGSR 


1050 


2400 


A 


8758 


3 


1660 


WVSSMGFEELLEQVGGFGPFQLRNVALLALP 

RVLU»LHFLLPIFLAAVPAHRCALPGAPANFS 

HQDVWLEAHLPREPDGTLSSCLRFAYPQALP 

NTTLGEERQSRGELEDEPATVPCSQGWEYDH 

SEFSSTIATESQWDLVCEQKGLNRAASTFFFA 

GVLVGAVAFGYLSDRFGRRRLLLVAYVSTLV 

LGLASAASVSYVMFArrRTLTGSALAGFTnV 

MPLELEWLDVEHRTVAGVLSSTFWTGGVML 

LALVGYLIRDWRWUXAVTLPCAPGILSLWW 

VPESARWLLTQGHVKEAHRYLLHCARLNGR 

PVCEDSFSQEAVSKVAAGERWRRPSYLDLF 

RTPRLRHtSLCCWVWFGVNFSYYGLSLDVS 

GLGLNVYQTQLLFGAVELPSKLLVYLSVRYA 

GRRLTQAGTLLGTALAFGTRLLVSSDMKSWS 

TVLAVMOKAFSEAAFTTAYLFrSELYPTVLR 

QTGMGLTALVORLGGSLAPLAAIXDGVWLS 

LPKLTYGGIAIXAAGTAUUPETRQAQLFEn 

QDVERKSAPTSLQEEEMPMKQVQN 


1051 


2401 


A 


8759 


515 


1625 


EIRTPVAVSSAPSGDSEGDEEBTTQDEVSSHTS 

EEDGGWKVEKELENTEQPVGGNEWEHEV 

TGNLNSDPLLELCQCPLCQLDCGSREQLIAHV 

YQHTAAWSAKSYMCPVCGRALSSPOSLGR 

HLLIHSEDQRSNCAVCGARFTSHATFNSEKLP 

EVLNMESLPTVHNEGPSSAEGKDIAFSPPVYP 

AGUXVCNNCAAYRKIXEAQIPSVRKWALRR 

QNEPLEVRLQRLERERTAKKSRRDNETPEERE 

VRRMRDREAKRLQRMQETDEQRARRLQRDR 

EAMRLKRANETPEKRQARUREREAKRLKRR 

I3KMDMMLRAQFGQDPSAMAALAAEMNFF 

QLPVSGVELDSQIXGKMAFEEQNSSSLH 


1052 


2402 


A 


8763 


110^ 


70 


RHGHGGRDRRGGGRVARPGGLGRYPGRGAA 

ASLVFVPTRRRSGPSGTASVAAMAYHSGYGA 

HGSKHRARAAPDPPPLFDDTSGGYSSQPGGY 

PATGADVAFSVNHLLODPMANVAMAYGSSl 

ASHGKDMVHKELHRFVSVSKLKYFFAVDTA 

YVAKKLGLLVFPYTHQNWEVQYSRDAPLPP 

RQDLNAPDLYiPTMAFrrYVLLAavIALGIQK 

RFSPBVLGLCASTALVWWMEVLALLLGLYL 

ATVRSDLSTFHLLAYSGYKYVGMILSVLTGL 

U^SDGYYVALAWTSSALMYFrVRSLRTAAL 

GPDSMGGPVPRQRLQLYLTLGAAAFQPLllY 

WLmiLVR 


1053 


2403 


A 


8768 


2 


712 


RPPRVWYPELRELSAAAPRWSHRTAPGIMVF 

YFTSSSVNSSAYTTYMGKDKYENEDLIKHOW 

PH)IWFHVDKLSSAHVYLRLHKGENIEDIPKE 

VLMDCAHLVKANSIQGCKMNNVNVVYTPW 

SMJCKTAD^a?VGQIGFHRQKDVKrVTVEKK 

VNEILNRLEKTKVERFPDLAAEKECRDREER 

NEKKAQIQEMKKREKEEMKKKBEMDELRSY 

SSLMKVENMSSNQDGNDSDEFM 


1054 


2404 


A 


8769 


344 


327 


REATTLACRNSCWVFSRCSLGACKPTVCSMP ' 
SLSRQGSQTL(XRLAEYCMESVDSQRUXS 
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seq- 
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conespondi 

ng to first 

amino acid 
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peptide 

sequence 


Predicted end 
nucleotide 
location 
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to last amino 
add residue 
of peptide 
sequence 


Amino add sequence (A=Alanine 0=Cysteine, 
D-Aspartic Add, E-Glutamic Acid, 
P=Phenylalanine, GNjIycine, t^Histidine, 
I^Isoleucine, K=Lysuie. L'=Leucine. 
M^MeUiionine, N=Asparagire, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threorine, V=Valine, W=Tiyploplian, 
Y^iyrosine, X-Unknown, *-Stop codon, 
A=possible nucleotide deletion, V=possib]e 
nucleotide insertion 


loss 


2405 


A 


8770 


430 


1104 


QQESPAAGAARMNCKEGTOSSCGCRGNDEK 
KMLKCVWGDGAVGKTCLLMSYANDAFPEE 
YVPTVFDHYAVTVTVGGKQHLLGLYDTAGQ 
EDYNQLRPLSYPNTOVFLICFSWNPASYHNV 
QEEWVPELKDCMPHVPYVUGTQIDLRDDPK 
TLARLLYMKEKPLTYEHGVKLAKAIGAQCYL 
ECSALTQKGIJKAVFDEAILTIFHPKKKKKRCS 
EGHSCCSn 


10S6 


2406 


A 


8773 


261 


332 


NPRIQLSGNSCCAGSCRVWLSEQ 


1057 


2407 


A 


8778 


3 


477 


PAGIRHEQAR.GADRMGKCRGLRTARKLRSH 
RRDQKWHDKQYKKAHLGTALKANPFGGAS 
HAKGIVLEKVGVEAKQPNSAIRKCVRVQLIK 
NGKKTTAFVPNDGCLNFIEENDEVLVAGFGR 
KGHAVGDIPOVRFKWKVANVSLLALYKGK 
KERPRS 


loss 


2408 


A 


8808 


171 


881 


PGLSQEPSGSMETWIVAIGVLAHFLASFAAL 

VLVCRQRYCRPRDLLQRYDSKPIVDLIGAME 

TQSEPSELELDDWITNPHIEAILENEDWIEDA 

SGLMSHCIAlLKlCHTLTEKLVAMrMGSGAK 

MKTSASVSDnWAKRISPRVDDWKSMYPPL 

DPKLLDARTTALLLSVSHLVLVTRNACHLTG 

GLDWIDQSLSAAEEHLEVLREAALASEPDKG 

LPGPEGKLQEQSAI 


10S9 


2409 


A 


8809 


246 


757 


MRLCJGAIFVLLPHLGPILVWLFTRDHMSGWC 

EGPRMLSWCPFYKVLLLVQTAIYSWGYASY 

LVWKDLGGGLGWPLALPLGLYAVQLTISWT 

VLVLFFTVHNPGLALLHLLLLYGLWSTALI 

WHPINKIAALLLLPYLAWLTVTSALTYHLWR 

DSLCPVHQPQPTEKSD 


1060 


2410 


A 


8810 


304 


381 


PKLSVYPLQSHHCLSEmjSLVCXJLA 


1061 


2411 


A 


8820 


1673 


848 


SCKTENLLEMWWFQQGLSFLPSALVIWTSAA 

FIFSYTTAVTLHHIDPALFYISDTGTVAPEKCLF 

GAMLNIAAVLClATTYVRYKQVHALSPffiNVI 

KLNKAGLVLGILSCLGLSrVANFQKTTLFAA 

HVSGAVLTFGMGSLYMFVQTILSYQMQPKIH 

GKQVFWIRLLLVIWCGVSALSMLTCSSVLHS 

QNFGTDIJEQKlJIWNPEDKGYVLHMnTAAE 

WSMSFSFFGFFLTYIRDFQKISLRVEANLHGL 

TLYDTAPCPINNERTRLLSRDI 


1062 


2412 


A 


8824 


1 


763 


GGAPPASVPARESPVSGAQGSSRTRGHKRAA 

GARAPQLCSSWQRRSAPAMSRGLQLLLLSCA 

YSLAPATREVKVACSEDVDLPCTAPWDPQVP 

YTVSWVKLLEGtffiERMETPQEDHLRGQHYH 

QKGQNGSFDAPNERPYSLKIRNTTSCNSGTyR 

CTLQDPIKJQRNLSGKVILRVTGCPAQRJKEET 

FKKYRAEIVLLLALVIFyLTLIIFTCKFARLQSI 

FPDFSKAGMERAFLPVTSPNKHLGLVTPHKT 

ELV 


1063 


2413 


A' 


8826 


147 


627 


CETSTSSAGHAPCRHAAQGPPAEPTGLRLCSE 

HQRLHAWPPGPRRPSLWPPKNGKWHSGKRT 

AGGRPQRRPSRRQSQRPSAWSGSPRMHSPGQ 

KCSLMCPHRSQDSLSTAIFQRSPGANTGRALH 

CVLSKEMKSVQRSLGLSRIHLQSKRKIIHFVL 

TR 


1064 


2414 


A 


8835 


2982 


1869 


LKDTLXSQMTQEASDEAEDMKEAMNRMIDE" 

LNKQVSELSQLYKEAQAELBDYRKRKSLEDV 

TAEYIHKAEHEKLMQLTNVSRAKAH)ALSE 

MKSQYSKVLNELTQLKQLVDAQKENSVSrre 

HLQVnrUlTAAKEMEEKISNLKEHLASKEVE 
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D=Aspaitic Acid, E=Glutamic Acid, 
F=Phenylalanine, G=Glycinti H==Histidine, 
I^Isoleucine, K-Lysine, IMjeuciae, 
N^^etfaiooine, N=Asparagine, P^>roline, 
QKjlutamine; R=Aiginine, S=Serine, 
T=Threonine, V=Valine, W=Tiyptophan, 
Y='I>rosine, X=Unknown, »=^top codon, 
/^possible nucleotide deletion, V^ssible 
nucleotide insertion 














VAKLEKQLLBEKAAMTDAMVPRSSYEKLQS 

SLESEVSVLASKLKESVKEKEKVHSEWQIRS 

EVSQVKREKENIQTLLKSKEQEVNELLQKFQ 

QAQEELAEMKRYSESSSKLEEDKDKKINEMS 

KEVTKLKEALNSLSQLSySTSSSKRQSQQLEA 

LQQQVKQLQNQLAECKKQHQENOSVYRMHL 

LYAVQOQMDEDVQKVLKQILTMCKNQSQK 

K 


106S 


241S 


A 


8841 


3 


663 


AAATAASLSPRGCRLRTPSSDVGPSRAPPPSA 

APLPTGRAQMSPSGRLCLLTIVGLILPTRGQTL 

KDTTSSSSADATIMDIQVPTRAPDAVYTELQP 

TSPTPTWPADETPQPQTQTQQLEGTDGPLVT 

DPBnDCSTKAAHPTDDTTTLSERPSPSTDVQT 

DPQTLKPSGFHEDDPFFYDEHTLRKRGLLVA 

AVLFTTGIIILTSGKaiQLSRLCRNHCR 


"1066 


2416 


A 


88S3 


3806 


2204 


FVQEQEGGC3BAGAGRGAQTYPGEAGERWFG 

RRRRRGRVVSRKKMSLKSERRGIHVDQSDIi 

CKKGCGYYGNPAWQGFCSKCWREEYHKAR 

QKQIQEDWELAERLQREEEEAFASSQSSQGA 

QSLTFSKFEEKKTNEKTRKVTTVKKFFSASSR 

VGSKKEIQEAKAPSPSINRQTSIETDRVSKEFIE 

FLKITHKTGQEIYKQTKLFLEGMHYKRDLSIE 

EQSECAQDFYHNVAERMQTROKVPPERVEKI 

MDQIEKYIMTRLYKYVFCPETTDDEKKDLAI 

QKRIRALRWVTPQMLCVPVNEDIPEVSDMW 

KATronEMDSKRVPRDKLACrriCCSKHIFNAI 

KTTKNEPASADDFLPTLmVLKGNPPRLQSNI 

QYlTRFCNPSRLMrGEDGYYFTNLCCAVAFlE 

KLDAQSLNLSQEDFDRYMSGQTSPRKQEAES 

WSPDACLGVKQMYKNLDLLSQLNERQERIM 

NEAKKLEKDUDWTDGIAREVQDrVEKYPLEI 

KPPKQPLAAIDSENVENDKLPPPLQPQVYAG 


1067 


2417 


A 


8855 


1372 


1513 


SNMREVGCX3WLVPVIPAFWEAEVGGSLEARS 
tRQAWATKQDPISKKK 


1068 


2418 


A 


8856 


1530 


1583 


PCRPGMECNSMISVHCNL 


1069 


2419 


A 


8857 


1530 


1583 


PCRPGMECNSMISVHCNL 


1070 


2420 


A 


8866 


293 


1675 


PYPQGGYPQGPYPQEGYPQGPYPQGGYPQGP 

YPQSPFPPNPYGQPQVFPGQDPDSPQHGNYQ 

EEOPPSYYDNQDFPATNWDDKSIRQAFIRKVF 

LVLTLQLSVTLSTVSVFTFVAEVKGFVRENV 

WTYYVSYAVFFISLIVLSCCGDFRRKHPWNL 

VALSVLTASLSYMVGlvIIASFYNTEAVIMAVG 

ITTAVCFTWIFSMQTRYDrrSCMGVLLVSM 

VVUTFAnxaPIRNRILEIVYASUJALLFraFLA 

VDTQLLLGNKQLSLSPEEYVFAALNLYTOIINI 

FLYILTnGRAKE'PSSSSLCPLRWHGWPGPCP 

WHGSASCTSPLSCPQAQPREKDASLQPSai4Y 

TADTSIWTRCGHSMAPLVLPPPPRGTKATFPC 

HIXSTHCCMSPVCQPTPGTGGSTRSRGEGLSQ 

EVRVHVFPPVPAPQPGVEHPSPPPHPPGVLPS 

GDMRSGGUPVLSPE 


1071 




A 
A 


oooo 


2 


358 


ARGNILYHLPRLCRKLNLRWFSASTLYDVQH 
DDKMGSNTFFKRNDCRYVMISCKADMAYDN 
VRHPFM*SI\KLIMEETYLNIIKAVYDRPTASn 
LNOEKLKVFPVRSGT»QGCSVWP 


1072 


2422 


A 


8870 


33 


6S8 


MESVLSKYEDQmFTDYLEEYPDTDELVWIL 
GKQHLLKTEKSKLLSDISARLWFrYRRKFSPI 
GGTGPSSDAGWGCMLRCGQMMLAQALICRH 
LGRDWSWEKQKEQPKEYQRILQCFLDRKDC 
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F=Phaiylalanine, G=<31ycine, H=Histidine, 
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Y=Tyrosine, X=Unknown, ♦=5top codon, 
/^possible nucleotide deletion, V=possible 
nucleotide inseitiaa 














CYSIHQMAQMGVGEGKSIGEWVLGPNTV\AQ " 

GV*KNLA\LFDEWVNSLGLVYVSM\DNPSGSIA 

RFPKIOLCRVLPLVSADTAGLTGP 


1073 


2423 


A 


8879 


146 


412 


DFSV*GDVDIEVTCPICLQLLTliPLSLNCGLRL 
*QVCITA*IKESVnSGG*SSSPVCHTTFQPANL 
RTSRYLPT*SIKSLGroEPQEG 


1074 


2424 


A 


8884 


67 


435 


HLQGRSXRTLQLTGJa^EKNCEVSERIRRSGPW 
KHSFGDYICHTFQGDCWADRSPLHEAAAHG 
RLLALKTLIAQGVNVNLWTL/DRVSSLHEACX 
•GPVACAKPYWKMVPRHGOTVTGPPLLMV 


1075 


2423 


A 


8896 


1294 


248 


RSGDKNGLTHQLGGI-SQGSRNQSYRSRSRSR 

SRERPSAPRGIPFASASSSVYYGSYSRPYGSDK 

PWPSUX>KEREESLRQKRLSERERIGELGAPE 

TSEEEKKKKSSRSKERSKKRRKKKSSKRKHK 

KYSEDSDSDSDSETDSSDEDNKRRAKKAKKK 

EKKJCKHRSKKYKXKRSKKSRKESSDSSSKES 

QEEFLENPWKDRTKAEEPSDLIGPEAPKTLTS 

QDDKPLNYGHALLPGEGAAMAEYVKAGKRl 

PRRGEIGLTR*RNCHHLNAQVM**VVSRHRR 

MEAVRTAKREraSTVLMRREPLHPFNPRRET 

KERB 


1076 


2426 


A 


8899 


146 


789 


GRSTEAEKEPAFDERTGKGRRLPRAGEFHG»E 

»APGPGPRSFQVSRKMPEE\PPGARKHPFSGKS 

FYLDLPAGKNLOFLTGAIOOLGGVIRttFI SK P 

VSYTVSSRREVKAESSGKSHRGCPSPSPSEVR 

VETSAMVDPKGSHPRPSRKPVDSVPLSROKE 

LLQKAIRNOK* ♦CTVOOLSHCRLYXGEKTTAK 

RSQREHVQQQSQEHGKWPDUCGPR 


1077 


2427 


A 


8901 


3S2 


3 


AKIGAVKYIQELWRKKQSDVMHFLLRVRCW 
QYPALHRAGTEWQLSAUIRAPRSTQPDKAC 
RLGYKAKQGY1IYRICVRROOWKC3>VPKAVT 
\YGKPVHHGVN*LKFAQSLQSVAEEQ 


1078 


2428 


A 


8903 


336 


781 


ACa>AENREVEEMAAGQAPHAGPGAGPGQPA 
PALPFAATPGSRGQALCRGGRRRQHLHGPLH 
RP*QAAPALHAGCX3LAPHPPT 


1079 


2429 


A 


8912 


121 


376 • 


NLrWKLCV'l'ERRLVlLDNYDLASE/YEANKYl 
CNRnQFKPGQDKYFTLGLPTGSTPL'CYPKLI 
EYNKNGHLSFKYVKTFSMDEY 


1080 


2430 


A 


8920 


381 


1788 


SSESPSDPGRMAMTWTVFSLWPLTVFMGHIG 

GHSLFSCCTnXJlMCQDIJYNTTFMPNLLNHY 

IX3QTAALAMEPFHPMVNUX:SRDFRPFLCAL 

YAPICMEYGRVTLPCRRLCQRAYSECSKLME 

MFGVPWPEDMECSRFPDCDEPYPRLVDLNLA 

GEPTEGAPVAVQRDYGFWCPRELiaDPDLGY 

SELHVRDCSPPCPNMYFRRfflLSFARYFIGUS 

nCLSATLFTFVTFLIDVTRFRYPERPIKCYAV 

WHMMVSLIFFMGFLLEDRVACNAVSIPAQYKA 

STVTQGSHNKACIMLFMILYFFTMAGSVWW 

VlLTrrWFUW^WKWGSEAIEKKALLFHASA 

WGlPGTLniLLAMNKIEGDNISGVafVGLYD 

VDALRYFVLAPLCLYWVGVSLLLAGnSLNR 

VRIEIPL*KENQDKLVKFMIRIGVFSILYLVPIX 

WIGCYFYEQAYRGIWETTWIQERC 


1081 


2431 


A 


8922 


36 


420 


EERTKMSTGPDyKATVGDlSSDGNLNVAQEE 
CSRKGIVDEFFPLLSN*CIWTQPQGYPQSSYG 
TLANFVF\CSVRHGLALILQLCa»lFSrYTQQMN 
LSIAJPAMVNNTAPPSQPNASTCRPST 


1082 


2432 


A 


8923 


355 


1079 


?FGTPSSTMAVVKNKCa.MKGGKKOVKKKVV 
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/— jX>ssible nucleotide deletion, V^ossible 
nucleotide insertion 














GPFSKKI)QYDVKAPAMFNIRNTOK/TLVART 
QGTQIASDGLKGLLFEVSLADLQNDEVAFRK 

ITITT l I'WI iVOriVKTn TKTirv/^XifTM 'VfTWir^ClL jn i 

EKWSTMIEAH\T)\aCTTTXjYFFHU?WGFTKK 

QTTTOIiCEVVNKLIPDNIGKDTEKV/CPIYPLH 
DWIRKYKMLENPOFERWIELROGGSSS 


1083 


2433 


A 


8948 


28 


385 


LTWFQPMPSCPAMSEETLQSKLAAAKKKLP 
WGAVQGSRAMSDLLXLLLDLTLLLLLMLLGF 
AOioUviLAuVAVSAGSPPI/RYKFHVEPYGET 
GWLLT/ESCSISPKLCSIAVH*DNPAWF 


1084 


2434 


A 








HV J J'lN tDl UlNSENNKCW*GY*E\VGLIHHW 
WGGKRVQPFWKRVWQKRTLNLRV 


1085 


2435 


A 


8956 


16 


413 


HMGQLGYnQCWWECKRLISF\WKTI*QSPAK 

♦TTYTSYDTAIPIS/GI/YPKRMSSKCHQETCAR 

MFILAPFTATKGKQLTCPLVEERIDYXMWYS 

HKYYIICVKWflL*VTrrH\TWVNLNILMFEIILW 

YSHKYY 


1086 


2436 


A 


8962 


868 


1026 


H*KILQVGRAQRAHXSRL*SQLLRRLRHESHL 
NtOjAKUCSEARLHRL. 1 FAwTT 


1087 


2437 


A 


8985 


S8 


330 


LHVKHLOHFQLVFSEVlCHaLMPVS*ELQRL 
♦ERSVCAFHVaQTYVCXQVYACMCVYYICM 
r V Yo V YGCGLCrCVCMDVYICVCVQEFL 


1088 


2438 


A 


8989 


394 


404 


N*K\raJtmjVRIQSIFF/KRNQK/INSHELKU3 
KKn-DMMSNA*STKKHDKLD/LIKFKT/LCSA 
KYTVKIUKIHPTDLEKMLKNHLSDKD*YS/GV 
YKDLSKiNRRKTE/S*ATCKWVKDLSRYFIKE 
VISMENKHKKIFSTS 


1089 


2439 


A 


8991 


60 


329 


MALTPESPSSFPGLAATGSSVPEPPGGPNATL 
NSSWDSPTEPSSLEDLEATGTIGTLLSDMGW 
o Vlu^NAr ILBVNiiKYMKAVGIM'IHL 


1090 


2440 


A 


8996 


2 


351 


SNmTLT*MKKYDNTFCW*GCGQIO/T/UyC 
WQESKFIQAFWSKIQQYLA*ISIHILFDPAFLFL 
ljOyi'ijljJQbVi'l,JGVl*VSSVlTfNMKMLJin 
LUAALFnVQYWKQSKDHYI 


1091 


2441 


A 


8997 


97 


456 


YPLPVCSYLSGPRGEHWNSLGGKSSCPLPUT 
LVSSRFKISKVIWGDLSVQKTCLINR*OOAO 
AH.ORVOPSLARWAGSRSQHLVPSQ\VCKDS 
FDiam^APIGADFEMERFEVLGIPF 


1092 


2442 


A 


8999. 


548 


811 


SSnKRHILIFEDDWHQTrCCHHPHHP\F*RCQ 
FHIFYVSVQNSISPSLSVSSSHPDRPDHEVHQH 
RAAHHHQHGQGPLGHGLVARVG 


1093 


2443 


A 


9002 


3 


2745 


ALLGLQQPAQSLILSRSSVMOVRGLQGFVGS 

TCPmCTVVlvn?KELAEHHRSKYIH3CTFnvVD 

AMCCLRYWYTPESWICGGQWRBYFSALRDF 

VKTFTAAGIKLIFFFDGMVEQDKRDEWVKRR 

LKNNREISRIFHYIKSHKEQPGRNMFFIPSGIA 

VFTRFALKTLQQETLCSLQEADYEVASYOLQ 

HNCLGILGEDTDYUYDTCPYFSISELCLESU) 

TVMLCREKLCESLGLCVADLPLLACLLGNDn 

PEGMFESFRYKCLSSYTSVKENFDKKCfNnT A 

VSDMSKVLYLYQGEKKLEEILPL/VTXQSSFL 

*RNGnSFTRT/[NLHGFSKNPKV«*LWTNK*YP 

RVQTPNPOKKFPCVQMLNPGKKFPCVQALNP 

GHCFPCIHI/PEPRQEVPTCSDPEPRQEVPTCTG 

PESRREVPMCSDPEPRQEVPMCTGPEPRQEVP 

MCTGPEARQEVPMCTDSEPRQEVPMCTDSEP 

RQEVPMYTGSEPRQBVPMYTGPESRQENa»MY 

TGPESRQEVLIR7DPESRQE1MCTGHESK0EV 
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D=Aspartic Acid, E=<}lutaniic Acid, 
F=PhaiylaIanine, G=GIycine, H=Histidine, 
I=lsoIeiicine, K=Lysine, D=Leucine, 
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■p=Tlireoninc, V=Valine, W=Tryptophan, 
Y=Tyrosiijei X=Untaiown, ♦=Stop codoo, 
possible nucleotide deletioB. V=possible 
nucleotide insertion 














PICIUPISKQEDSMCTHAEINQKLPVATDFEFK 

LEALMCTNPEDCQEDPTNVGPEVKQJQVTMVS 

DTEILKVARTHHVQAESYLVYNIMSSGEIECS 

NTLEDELDQALPSQAFIYRPIRQRVYSIJXED 

CQDVTSTCl^VKEWVYPGNPLRHPDLVRFL 

QMTIPGGTPSLKlLWLNQEPEIQVRRLDTnULA 

CFNLSSSREELQAVBSPFQAUXXLIYLFVQV 

DTLCLEDLHAFIAQALCLQGKSrSQLVNLQP 

DYINPRAVQLOSLLVRGLTTLVLVNSACGFP 

WKTSDFMPWNVFDGKLFHQKYLQSEKGYA 

VEV1/CRTK*ISAHQIPQPEGSRLQGLHEGEQT 

HHWPSPLGLTPRREVdKTGLQLPQDGLWV 


1094 


2444 


A 


9021 




834 


AREACRAKTDFPGRRFRLWPSCCCRVrVGAE 

T*H\MAEPVSPLKHFVLAKKArrAIFDQLLEFV 

TEGSHFVEATYKNPELDRIATEDDLVEMQGY 

KDKLSDGEVLSRRHMKVAFFGRTSSGKSSVI 

NAMLWDKVLPSGIGHTTNCFLSVEGTDGDKA 

YLMTEGSDEKKSVKTVNQL/VHALHMDKDLK 

AGCLVRVFWPKAKCALLRDDLVLVDGPGTD 

VTTELDSWIDKFCTKSSTREITNSGSDT 


1095 


2445 


A 


9022 


1 


537 


LVLNSRVEDFVPPEGAGRTLPFALRPLAACW 
LLHRRARRSSALCPRPRSWOVSGGEGAGARE 
P*rrSSSCXa,SAA/SHLSIQSPNMAGARRRIRPQ 
LAKEKIEGCMICTSVTPGEPQVFLGKDKAFTF 
DYVFDIDSQQEQIYIQCIEKLIEGCFEGYNATV 
FAYOQTVGAGKTYTMGTGFD 


T096 


2446 


A 


9029 


1 


285 


FFFFNVCKSPKVPKPGCKEESTGTLFKNTUSL 
GQHSETPSLKKK\LAGYSGMCL*SQVLRRLRQ 
EDCLSPGGGNCRES*SCPYrPAWrrERDPV 


1097 


2447 


A 


9032 


716 


357 


ARSTGFWGEILWCX3FLKRSLALSPRVKCSGAI 
1^C3<IFRHAGFPPLSCLSLPNRWEYRRPPARP 
GKFELVFLVETGFQC/G*1X5LDLLTSRSACLG 
U»KCWDYRREPAASIIFQrrFFINSK 


1098 


2448 


A 


9038 


230 


652 


KVVVMSCKDINISGSFYRNKLKYLAFLCKRTS 
TNPSQCTYHLWVPSHIFWQTTCGRLPHKTICQ 
G*AALDHLKVFDRIPLPYDKKKQMAVSATLB 
VVRPKP*RKFAYLGHWAQKNa>WKYQAMTA 
TMGEKRKVYYQKICYQKK 


1099 


2449 


A 


9043 


185 


372 


llFYSHgQCMRV/WQGCGDIETLlHCW*E*KII 
HSL/WK/rV*QFLKRLYLHLPHNSVIAFLGISP 
RKIKTCPQNSCTSMLINAIHNDQK\iTKKINI 


1100 


2450 


A 


9045 


763 


584 


RQSLALSPRLJECSGTISAHCRLCPLVFTPLSa. 
SLTSSWDYRRPPPHPANFLYFK*RRGF 


1101 


2451 


A 


9050 


275 


2 


LFFLRKVSNQFLSPSLLPVNFQGFVFAFUULLL 
FLL/FEMESLPViVRVECSOTISAHCjnXaPGSS 
DSPASAS*VAOITDMCRYTQLILFHAS 


1102 


2452 


A 


9053 


449 


1224 


KTSMFWKFDLHSSSHnmUEREDVTLKELM 

DEEDVLQECXAQNRKXIEFLLKAECLEDLVSF 

I\*EEPPQDMDEKIRYKYPNISCELLTSDVSQM 

NDRLGEDESLLMKLYSFLLNDSPLNPLLASFF 

SKVLSnJSRKP'EQIVDFLKKKHDFVDLIIKHIO 

TSAIMDLLLRLLTOEPPQPRQDVLN/WFKVQ 

RNL*HST*NVMDISKYVNLHWGLNKSHSLL» 

IXLQCVLQWLNEEKnQRLVErVHPSQEEDVS 

SLV 


1103 


2453 


A 


9058 


403 


3 


GLHVYDFQVYREHILTLNVKKCSVSFWOLRE 
WLYLQMYEnKSPRFPIIKMTDrnCCW*GaGA 
AGMQI/H/CW\WCVNVGKFWEMS*YYLLKLSI 
ST/PYDPAIPLI,Grn-*ETRVYIHPKTCMRKlLIA 
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APFVLAVNC 


1104 


2454 


A 


9064 


75 


393 


KWLFSSLMTGRGDIIGHLKWLDCRVNCSSFPI 
KRNRQTHSTESNKLKAOHSFGYN*LIH*NS\V 
KTDCGCGANSKGVVVVMKV\KTAQQKQrrS 
YMQIGTTKNSRAT 


1105 


2455 


A 


9065 


366 


778 


DLLILW«.AFPELKRRNCISRITLAYHLHKIYS 
RSniXaWCSGFYILSL'QYDVFFFNYFFFRDR 
AWPCCPGWSAAWLnVILAHYRRPGLERSCC 
LSLSSSWDHRRVPPCPANF*/YFSMGFTAFPRL 
VLNS»TQG1 


1106 


2456 


A 


9083 


673 


816 


ESGSLIH»WWENKPAQPLWWEI*QHV<)KLPT 
HFPCDPAIPLLGICPED 


1107 


2457 


A 


9086 


580 


18 


KPSSGSFIRAIYIFLSTAHVPALFSVLVRTKLT* 

AFSQSSVLWAHKQQKTSLSLVIR/ERLQIKTA 

VRENFLPIRLAKIUa.DNVKCWQG/SGSNMSL 

I/HCWWEY>WIHIIWNSVTFPRKVEHVYrrYA 

PEISVR'HGGLPTLVHQETHTSVFRGAPSVIP 

ETR\CRPTKESlNKLLmYTMEHYGDENK 


1108 


2458 


A 


9093 


540 


1 


GGNDCSVTPTTEPGRKE1T*KRKF*EKTDRLP 

GA/PPSRTPPTPYPCPHGDRLLPPSRPLPAGPA 

SAFPPAERSRGHRRASL*RARWSAAVPRRSA 

GSASEPVQSRWLRLPVGSDSPPAVPVRVCPAP 

DSRPAAPGSRLPDPGLDSPAPSRTPSSSVD*GG 

QRPPPPS(H)SLSPPGCCRY 


1109 


2459 


A 


9099 


1255 


1425 


HESYHVNPNLCNFVAPTSGAHSIG*KWPSWL 
GAVAHSCNPSTLVGRGGRITRGQELR 


1110 


2460 


A 


9103 


242 


70 


EEQFFFFAVGMFP»VDFLAPASGELWDRLRLT 
CSRPFTRHQSFGLAFLRVCSSLDSLDDSWGP 
SALLSSVL/NQOGKNVLEAREAAKHPTI*RQS 
LLRKQRNKRMAIP 


nil 


2461 


A 


9110 


189 


121 


SFLSVRLECNGAIMAHCALPLPG 


1112 


2462 


A 


9113 


100 


910 


RRRGGGSRPRRTPVPAPGPGPSFGMDVRFYP 

AAAGDPASLDFAQCLGYYGYSKFGNNNNYM 

NMAEANNAFFAASEQTFHTPSLGDEEFEIPPIT 

PPPESDPALGMPDVLLPFQALSDPLPSQGSEFT 

PQFPPQSLDLPSmSRNLVEQDGVLHSSGLHM 

DQSHTQVSQYRQDPSLIMRVPSST*PDAARSG 

VMPPAQLTTINQSQLSAQLGLNLGGASMPirr 

SPSPPASKSATPSPSSSINEEDADEANRAIGEK 

RAAPDSGKKPKTPKK 


1113 


2463 


A 


9120 


3452 


3051 


FLRPSFALVPQAGVQWCALSWLQPPSPRFK'F 

SCLSLPSSWDYBHVPPRPANFFV1X.VETGFLH 

VGQAGHEPLTSGDPPASASQSAGTTGVSHQA 

WPSFFIFSRDTVLLCCSGWSRTSGLKQSACLS 

LLKCWDY 


1114 


2464 


A 


9122 


152 


377 


NQLPLQQWTFKIYETGFCSVAQAGVQCRDHS 

SLHP*PPG\SSDPPAPPS*VLGITGQRYHACLn 

YLYVQTVPQRV 


ins 


246S 


A 


9124 


553 


981 


QRPLLRQQLGSWPTCRSLEQDLASPW**RLPG 

SPRMRRSOT/ATLNLPLSPQGTVRTAVEFQVM 

TQTQSLSFLLGSSASUXXa'SMAPGLDLISVE 

WRLQHKGRGRGDUn^DHHLSVPSSADHPA 

QQPSQFNGRNLYFLPLFR 


1116 


2466 


A 


9135 


48 . 


410 


SASHEPAEHDGGADSLSASQPPRPAOKPAOA 

QHVHVPPWTDVLAGQDRRAPTAODQAPW 

APGGHVPSTRPHDPAEFHADEAAGRGGRGLQ 

PAAPHALPAGLPHGPPAPA/PAEGGGTP*GSA 

GAGGP*GSPAGRACGAAGCRPRPPRPAASSA 

*NSAGS*GLVEGT*PPGAGHGAPSPAVGARLS 
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CPARTSVQGGTWTC*APAGRPAGLGGWEAE 
RESAPPSCSAGS*DAD*GAEPWGAGSRSWGS 


1117 


2467 


A 


9141 


380 


939 


KSGHWAKECLQPRIPPRPCPICVGPHWKSDCP 
TCPGAVPRAPG11PQGSLTDSFPDLLSLVAED 
*CC*I MA^KA^wnrxPT uruTT 'r\7vmrQwx>/n 

wV'l^lALOdflO W 111 \EfLtYf V ILil VuVJ^O Vr^/oLi 

NTEATHSTLPSFQGPVSLASrrWGIDGQASKP 
LKTP(^WCQLGQYSFMHYFLVIPTCFVPLLG* 
GILTKLSAFLTlPRL<yHLIAALSPSS 


1118 


2468 


A 


9154 


471 


2 


AAGQVWEVTSHLYLCrrSDAAGLRLLPPAES 
ERGEGGHCPAEAPLPPRPQYCLAKHPLLRKLP 
HBKnCLDPYLTQHTKINSKQIKYLS/VRAKTTQ 
LVEGNIGVNLQNTELKQH*INGFLDTTPEAQE 
TXEKTNKLNFDCKVKRQLAEWEKIFQIA 


1119 


"2469 


A 


9155 


2 


3187 


ACPRLARRRRRVRSLRRRRGWLRARWSRGQ 

NNMAARRITQETFDAVLQEKAKRYHMDASG 

EAVSETLQFKAQDLLRAVPRSRAEMYDDVHS 

DGRYSLSGSVAHSRDAGRESU^VFSGPSFR 

SSNPSISDDSYFRKECGRDLEFSHSNSRDQVIO 

HRKLOHFRSQDWKFALRGSWEQDFQHPVSQ 

ESSWSQEYSFGPSAVLGDFGSSRJLIEKECLEK 

ESRDYDVDHPGEADSV/LRGGSQVQARGRAL 

NIVDQEGSLLGKGETQGLLTAKGGVGKLVTL 

RNVSTKKIPTVNRrrPKTQGTNQIQKNTPSPD 

VTLGTNPGTEDIQFPIQKIPLGLDLKNLRLPRR 

KMSFDIDDKSDVFSRFGIEnKWAOFHTIKDDIK 

FSQLFQTLFELETETCAKMLASFKCSLKPEHR 

DFCFFTIKFUaiSALKTPRVDNEFLNMLLDKG 

AVKTKNCFFEIIKPFDKYIMRLQDRLLKSVTP 

UJWWCNAYELSVKMKTLSNPUJLALAIJETTN 

SLCRKSLALLGQTFSLASSFRQEKIL*AVGLQ 

DIAPSPAAFPNFEDSTUFORBYIDHLKAWLVS 

SGCPLQVKKAEPEPMREEEKMIPPTKPEIQAK 

APSSLSDAVPQRADHRWGTIDQLVKRVIEGS 

LSPKERTLLKEDPAYWFLSDENSLEYKYYKL 

KLAEMQRMSENLRGADQKPTSADCAVRAML 

YSRAVRNLKKKLLP\WQRRGLLRAQG\LRG\ 

WKARRAVTTOTQULFLRAPGLKHHGRQAPO 

LSQAKPSLPDRNDAAKDa>PDPVGPSPQDPSL 

WFUHDQNSSAFKFYRKKVFELCPSICFTSSPH 

NLHTGGGDTTGSQESPVDLMEGEAEFEDHP 

PREAELESPBVMPEEEDEDDEDOGEEAPAPG 

GAGKSEGSTPADGLPQEAAEDDLAGAPALSQ 

ASSGTCFPRKRISSKSLKVGMIPAPKRVCUQE 

pkgecppvgtvassrvlgwwavrvrrdrwh 

HFNPKEFCAPLQNVSRHSCFPW 


1120 


2470 


A 


9163 


124 


207 


ppracrpcpracpcppt*kcsqpvswpc 


1121 


2471 


A 


9166 


272 


523 


PMSSLQGCl'YiFKCllFKQlFLLLISNLIAF**EK 

V/CSfflTDSLKnOKGWVGMVTHACNPGTLG 

G*GGWIA»VREFETSLGNM 


1122 


2472 


C 


9170 


442 


236 


MNRRRFLRPADCHSGMRGIENGACSEGESQI 
HCGAGGEGVQLVHWNQPENGCLQFDSTHIT 
FSKRQN* 


1123 


2473 


A 


9171 


10 


423 


MVDRSPLLTSVUFYLAIGAAIFEVLEEPHWKE 
AIOChrmXJKimUCHETCLGQEGLDKlLEVV 
SDAAGQGVAITGNQTTNNWNWPNAMIFAAT 
VrraGYGNVASKirOGRLFCOFYGLFOVPFC 
LTWlNALOKFfG 
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T^Threonine, V-Valine^ W=TiyptoiAan, 
Y=Tyrosinei X=Uniaiown, *=Stop codon. 
A=possible nucleotide deletion, \=possible 
nucleotide insertion 


1124 


2474 


A 


9173 


3 


374 


GPSPSLLVLLPQEPGGTGTPVRAGAGAGMWL 
WEDQGGLLGPFSFUvlLMLLLBrRNPVNACLL 
TGSLFVLLOVFSFEPVPSCaiALQELKPRDRISA 
IAHRGGRHDPPENTLGAIRA}GS»*WSNKR 


1125 


2475 


A 


9179 


704 


188 


ESSSGLLFQJCFQGIHVQKLTLQARPTLFSWWL 

CSKPPKETGBLENAESGGDGGRRGGKQDNV 

AWWRRM\QKG\DFPWDDEDFPQSGPFGGQA 

LPMOITYLYFRDPORErrWKHFVQYYLAROL 

VDRLBWNKQSVRVIPAPGTSSEVRGEFKAE 

YOIHKFISCKNWFYFFQ 


1126 


2476 


A 


9183 


153 


233 


MEYMAESTDRSPGHDUCCECGVPISPN 


1127 


2477 


A 


9185 


1 


321 


LTGQLGSILLRVFSKSRAGLGARKLKAYRTM 
EYMAESTDRSPGHIL(XECXjVPISPNPAQY\CV 
AOJlSSFHIYHCIPKLFIHPFSKTSSSAFrrPSHY 
LTFFSnS 


1128 


2478 


A 


9186 


183 


847 


VLKELLLQTMDEQSQCailQGPPVPQFQPQKAL 
RPDMGYNn.ANFRIEKKIGRGQ\FSEVYRAAC 
LVLDGVPVALKKVQIFDLMDAKARADCIKEID 
LLKOLNHPNVIKYYASFIFDNFr TJIVr RT AHA 

GDLSRMDCHFKKQKRUPERTVWKYFVQLCS 
ALEHMHSRRVMHRDIKPAh4VFITATGVVKLG 
DLGLGRFFSSKTTAAHSLVGTPYYMSPERIHD 
NG 


1129 


2479 


A 


9190 


1 


370 


GTSWKJPSAAVSESSPNQAAYASGLPCXJVRG 
PPWAGLALLPSPTLMALLRRPTVSSDLDNlDf 
RATTtfCIRWATirRARIEDMRHSATALTRPD 
ATTAQIPKLPVTTVCNRRANPGIPPS\rL 


1130 


2480 


A 


9194 


131 


487 


AYLKRLPVPESITGFARLTVSEWLRLLPFLGV 
LALLOYLAVRPFLPKKKQQKDSLINLKIQKEN 
PKWNEINIEDLCLTKAAYCRCWRSKTFPAC 
DGSHNKHNELTGDNVGPULKKKE 


1131 


2481 


A 


9201 


184 


605 


kelvdeksergramdpvsqlasAgtervlke 
plaflralellpaifafatcggysgglrlsvd 
cvnktesnlsidiafaypfrlhqvrfegvptce 
okerhklaugdssssaeffgtvaofaflysl 
aatgvyiffqnky 


1132 


2482 


A 


9206 


1 


852 


GGGRAGAGSRDMGSTDSKLNFRKAVIQLTTK 

TQPVEATDDAFWDQFWADTATSVQDVFALV 

PAAEIRAVREESPSNLATLCYKAVEKLVQGA 

ESGCHSEKEKQrVLNCSRLLTRVLPYIFEDPD 

WRGFFWSTVPGAGRGGQGEEDDEHARPLAE 

SLLLAIADLLFCTDFTVQSHRRSTVDSAEDVH 

SLDSCEYIWEAGVGFAHSPQPNYIHDMNRME 

LLKLLLTCFSEAMYLPPAPESWQH/RTHWFSS 

FVSSENRHALPLFTSLLNTVCAYDPVEYQIPY 

NHLY 


1133 


2483 


A 


9208 


1165 


1463 


GPRARVQGFSGADIVKEMALGSMYLVLTUV 
AKNOJIGAEPCCGPLKNRVLRPCPLP/VPLPPP 
HPQPSRGI^VGCLPTYKVVYKLLSWPLHSNS 
NVYFTV 


1134 


2484 


A 


9210 


66 


1586 


MAGAGPKRRALSAPVAEEKEEAREKIMAAK 

RADGAAPAGEGEGVTLQ(a>JITLLKGVAVIW 

AIMGSGIFVTPTGVLKEAGSPGLALWWAAC 

GVFSIVGALCYAELGrnSKSGGDYAYMLDV 

YGSLPAFUaWXELLimPSSQYIVALVFATYL 

UCPLFPTCPVPEEAAKLVACLCVLLLTAVNC 

YSVKAATRVQDAFAAAKLLALALmXGFVQI 

GKGDVSNLDPNFSFEGTKLDVGNIVLALYSG 

IJAYGGWNYLNFVTEEMlNFYRNLPLAmSLP 
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Amino acid sequence (A=Alanine OCysteine. 
I^^AsDSTtic Acid E?=^Iiitamic Acid 
F=Phenylalanine, G=Glycine, H=Histidine;, 
I=IsoIeucine, K=i,ysine, JU=Leucine, 
M=Mediionine, N=Aq>aragine, P=4>roline, 
Q=Glutamine, R°-Arginine, S=Serine, 
TVThreonine, V=Valine, W=Tiyptophan, 
Y=Tyrosine, X^Unknown, ♦=StDp codon, 
A=possible nucleotide deletion, \=possib]e 
nucleotide insertion 














IVTLVYVLTNLAYFTTLSTEQMLSSEAVAVDF 

GNYHLOVMSWnPVFVOLSCFGSVNOSLFTSS 

RLFFVGSREGHT PSILSMTFTPOT T TPVP^T VPT 

CVMTLPYAFSKDIFSVINFFSFFNWLCVALAn 

GMIWUUnaCPELERPKVNLAIJVKFILACLF 

HAVSFWKTrPWSVASDFmLSGLPVYFFGV 

WWKNKPKWAPPGHLSPRPSCVRSSCMWPQ 


1133 


2483 


A 


9216 


40 


410 


RDKLPPAYFCRPWCWTALDVGVSPESQEM 
DLVAFEDVAVNFTQEEWSLLDPSQKNLYREV 
MQETTJiNLASIGEKWKDQNIEDQYKNPRNNL 
RSLLGERVDENTEENHCGETSSQIPDDTLNK 


1136 


2486 


A 


9223 


3 


983 


RRKRRSRYRRCSRFPRPGPLAVSMPHAFKPO 
DLVFAKMKGYPHWPARIDDIADGAVKPPPN 
K I Jrlrrru 1 Ht 1 Ar J^ui'KJDLrlTrDK.CKDKYGK 
PNKRKGFNEGLWEIQNNPHASYSAPPPVSSSD 
SEAPEANPADGSDADEDDBG\RGVMAVTAVT 

MSVSKRARKASSDLDQASVSPSEEENSESSSE 
SEKTSDQDFTPBKKAAVRAPIUIGPLGGRKKK 
APSASDSDSKADSDQAKPEPVAMARSASSSSS 
SSSSSDSDVSVKKPPRGRKPAEKPLPKPRGRK 
PKPERPPSSSSSD 


1137 


2487 


A 


9229 


21 


239 


LFPRLECRDPVTVNCTLNLPGSKNAPTTASQV 
GSTWNYRGOLPHPTNFFVKTGFRCSQAGLKL 
RGSREPPAWA 


1138 


2488 


A 


9231 


1664 


2 


TRSVGVNTCEVGWTEPECLGPCEPGTSVNL 

EGIVWHETEEGVLWNVTWRNKTyVGTLLD 

CTKHDWAPPRFCESPTSDLEMRGGRGRGBCR 

ARSAAAAPQSEASFTESRGLQNKNRQOANGK 

GRROSLNASGRRTPPNCAAEDIKASPSSTNKR 

KNKPPMELDLNSSSEDNKPGKRVRTNSRSTP 

TTPQGlCPETmX)QGCSSPVLIDCPHPNCNKK 

YKMNGLRYHQAHAHLDPENKLEFEPDSEDK 

ISDCEEGLSNVALECSEPSTSVSAYDQLKAPA 

NSGKKKGLNNELNNLPVISNMTAALDSCSAA 

DGSLAAEMPBCLEAEGLIDKKNLGDKEKGKK 

ANNCKTDKNVPSKI KSABPIAPAPAPTPPni lA 

IPTATFTTTTTGTIPGLPSLTTTVVOATPK'SPPI 

ia»IQPKPTIMGEPrrVNPALVSLKDKKKKEKR 

KLKDKEGKETGSPKMDAKLGKLEDSKQASK 

DLPOHFLKDHLNKNEGLANGLSESQESRMAS 

KAEADKVYTFTDNAPSPSIGS 


1139 


2489 


A 


9234 


207 


443 


TRRGQPWRRRAAAAGILPGREAAAajPSC/AS 

VTAAVSGLLVGYELGnSGAIXQIKTIJLALSC 

HEQEMGVSSLVIGALL 


1140 


2490 


A 


9238 


248 


328 


MAQGNNYGQTSNGVADESPNMLVYRKV 


1141 


2491 


A 


9242 


2 


535 


FVEAAVXMLGSLVLRRKALAPRLLUILLRSP 

TLRGHGGASGRNVTTG5?LGEPOWr HVATOO 

RPGTSPALFSGRGAATGGRQGGRFDTKCLAA 

ATWGRLPGPEETXPGQDSWNGVPSRAGLGMX 

WPWAAALWHCYSKSPSNKDAALLEAARAQ 

VNMQEVSRNRCAIXHSAAVQEYGYGN 


1142 


2492 


A 


9245 


157 


466 


HlXFWtKVGUTLPEQQIMLFATLLRMAQGCD ' 
FAIX3M51^NrrTKAQAnTCEKIX)KLDFIKIKrC 
CTSMDAIEKTEPLTKWTKAFVSHVSYKRLLF 
GICXEYSRQ 


1143 


2493 


A 


9247 


264 


113 


GLPQQTSnQPPGTPDGARDFTSTIQPPGAPDG 
ARDSTSnRMGPEIPPP 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

coirespondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
locatioi 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A<=Alanine OC^eine, 
D=Aspartic Acid, E=<}lutamic Add, 
F^Phenylalaninc; G=Glycine, H=Histidine, 
I=IsoIeucine, K=Lysinei, I^Leucine, 
M=Methionlne, N=Asparagine, Pt=Proline, 
Q=GIiitamine, R=Arginine, S=Serine, 
T^Threonmc, V=Valine, W=Trypitophan, 
V=Tyrosine, X=4Jnkiiovvii, ♦=Stop codou. 
A=possible nucleotide deletion. V=possibIe 
nucleotide insertion 


1144 


2494 


A 


9260 


1 


401 


KKVPGRLSEMSFSLNFIlJ»AinTSSPVT\DCX}P 

SLGLAAGIPLLVATALLVALLFTUHRBRSStE 

AMEESDRPCEISEIDDNPKISENPRRSPTHEKN 

™gaqeahiyvktvagseepvhdryrptiem 

ERRR 


1145 


2495 


A 


9264 


175 


411 


metwiyqfrlieigdstvgkscixhrfrqgrf 
pglrspacdptvgvdffsrlleeepgkriklll 
wdtagqerfisit 


1146 


2496 


A 


9277 


592 


814 


Wl'YLEGREGIKSQPKMEPHSVnRLECSGMI 

SAHCSIJ4IJGTSDSPASASR/VAGTTGMRHHA 

WLIFAFLVBTGF 


1147 


2497 


A 


9279 


1255 


2 


frrgrrgeeekeeeeeeeegwvngmenshpp 
hhhhqqpppqpgpsgerrnhhwrsyklmidp 
alkkghhklyrydgqhfslamssnrpvetve 
dprwgiwtknke\lelsvpkfkidefyvdqv 

PPKQVTFAKLNDN1RENFLM)MCKKYGEVEE 

hlhstsvmgniihveldtkgetrmrfyel\lv 

tgrytpqtlpvgeldavspivnetlqlsdalk 

rlkdgglsagcgsgsssvtpnsggtpfsqdta 

ysscrldtpnsyg/qgtpltprlgtpfsqdssy 

ssrqptpsylfsqdpavtfkarrheskjtday 

nrrhehhyvhnspavtavagataafrgssd 

lpfgtvggtggssgppfkaqpqdsatfahtpp 

paqatpapofr 


1148 


2498 


A 


9302 


1026 


6 


lasiqnadtmpgvgllvshfstxvsrqrcpny 
adpqnltdvsiflu^sgdpelqpvlaglfl 

SMCLVTVLGNLLITLATSPnSHT HTPJk/rVFPPWJ 

lslpdv\gftsttvpk\mivdi\qsrsrvisyag 
cltqkslfaifggteevnmllsvmaydrfvai 
oiplyhsaimnpcfcaflvllsffflslldsql 
hswiylqftoknvhsnfvcdpsqllkfacsd 

SnNSIFrYTHKDPERQLVLAGLFLSMCLVTVL 

gnuiiix>vspdshlptpmyfflsnlslpdigft 
sttvpkmivdiqshgrvifyagcltqmslfaif 
ggmeerhapecdgl 


1149 


2499 


A 


9303 


1 


699 


masqekdmowgtihlfrkpqrsffgkllre 

frlvaadrsmgrymlfgvinlictgfllmwc 

sstnsialt\syiyltifdlfsln4tclisywvtl 

rkpspvysfcserlevlavfastvlaqlgalf 

ilkbsaerfleqpeihtgrllvgtfvalcfnlf 

tmlsirnkpfayvseaastswlqehvadlsr 

SLCGnPOLSSIFLPRMNPFVLlDLAGAFALaT 

ymliei 


USO 


2500 


A 


9308 


797 


693 


drstsvtragvqwcslgslqprtpgllrsscl 

SLP 


llSl 


2501 


A 


9309 


205 


406 


vakelpvlwkwskptrmakeppqtqqrag 
sktaappcqwsrmasegpnipctoarhsdkq 
Fucn 


11S2 


2502 


A 


9314 


913 


504 


kpsplitppawlppsavlnlvntfssfpqveV 
qgplcgprkgrlavnpffols/lpkymdhrr 
ppphr\eiffvflaetofhrasqaopdlpts/s/i 

PPTSA/EPKCWEYRSEPQCU>GCLSFSGILLDL 
GTNVSLRAA 


I1S3 


2503 


A 


9315 


392 


1 


hphrprpgfrsparssrpcpvltsixppfpspsp 
paddlvkagrdrkdpqvr/errlrpnpgrlg 
gprvprparars^chqprltrvcprspppeara 
papaapargrgapkrnrprtdtraprgssar 

PONS 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ED NO: 
in 

USSN 
09/496 
914 


Predicted 

nucleotide 
location 
correspondi 
ng to first 
amino add 
residue of 
peptide 
sequence 


Predicted end 

UUiilCOUUC 

location 

to l&st smino 
acid residue 
of peptide 


Amino add sequence (A=Alanine OCysteinc; 
D=Aspaitic Acid, E=GlutanucAcidy 
F=PiienylaIanine, G=GIycine, H=Hi^ine, 
l=l5oJeucui£^ K— Lysine^ L^^^eucine^ 

Q^Glutaraine, R=Aiginine, S=Serine, 
T=Threoninc, V=Valine, W=Tiyptophan, 

/=possible nucleotide deletion, V=possible 

nuclentidp incprHnn 


1154 


2504 


A 


9321 


331 


433 


MPCI/QAQYGTPAPSPGPRDHSASDPLTPEFK 
PT 


1155 


2505 


A 


9324 


180 


275 


MEEPQSDPSVBPPLSQETFSDLWKLLSENNYL 


1156 


2506 


A 


9326 


383 


619 


MISPSRTEGDPLPUPP/EGEGQEVRGFGGGPAK 

cA/VVKilL-KAo V o11jKMK1u*w^SSRPARVPL 

RGPDSHRLREPPPSPP 


1157 


2507 


A 


9327 


152 


292 


YERRGRSQGGGSHPAQAQPGORAIOAGWQS 

UTC/^T /^¥> octroi 

JtlirL.WbOLAjRSGSPLPG 


1158 


2508 


A 


9328 


1 


430 


QELKQGPNPLAPSPSAPSTSAGLGDCNHRVD 

LSKTFSVSSALAMLQERRCLYWLTDSRCFL 

VCMCFLTnQALMVSGYLSSVrmERRYSLKS 

SESGLLVSCFDIQNLWWFVSYFRGRRRRP/ 

RVAAVGGLLDLEGG^II 


1159 


2509 


A 


9334 


108 


383 


KGNQVNGNGNQLKRKHESMCPVSLTQNTVR 
Ll^kffiAGLPQKQAERADELFEAGLVIYVKLDER 
VLNALWSSVGLQWFKESDLSHLRLLEISFR 


1160 


2510 


A 


9338 


2 


430 


FVORPROLSDRLEDLFLAOFRV0ERLRTAAM 
KRYVRILLLGEGAEHVADPVPGGRGVPRGEA 
DHTDQELRffilHKANVERWHDVSQEATIEia 
RTKWIPLV/RWGDHA/EGPVGIKSYLPSGRSM 
EAELPIMSQLTEIETCVEC 


1161 


2511 


A 


9341 


1 


390 


NSRVDDFVAPGLSEAGKLLGLEFPERQRtAA 

AVG/CSPMSGVISMSAPFFLGKIIDAIYTNPTV 

DYSDNLTRLCLGLSGVFLCGAAANAIRVYLM 

QTSRQRWKRLRTSLFSSILGQEVAFSDKAGT 

GELI 


1 ISO 




A 
A 


yj43 


84 


837 


QGRFRAFCWQRDFLQPPGMRLSALLALASKV 

TLPPHYRYGMSPPGSVADKRKNPPWIRRRPV 

WEPISDEDWYLFCGDTVEILEGKDAGKQGK 

WQVIRQRNWVWGGLNTHYRYIGKTMDYR 

GTMEPSEAPLLHRQVKLVDPMDRKPTEIEWR 

FTEAGHIVRVSTOSGRIIPKPEFPRADGIVPET 

WIDGPKDTSVEDALERTYVPCLKTLQEEVME 

AMGKBTRVNTRRSIGIEPOAEQLLPNFCPSLE 

G 


1163 


2513 


A 


9346 


967 


616 


DSLALSPRLECSGAISAHCNLTPPGFTPFSCLS 
IJ'SSWAYRCASPHPDNFFVELVESGFHHVGQ 
AGLKLLISSDPPTSA/FPKCWDYRRD\SSAPAT 
FSSYQRNNPDULNDTIMPNIK 


1164 


2514 


A 


9347 


3 


1099 


SSmXMRTVFHSNTSVSSLLHRPGHVTPQLTl 

HGGWRHHRDHTAIDEWDFNPSKFLIYTCLLL 

FSVLLPLRLDOnQWSYWAVFAPIWLWKIXV 

VAUAbVuAGVWARNPRYRTEGEACVEFKA 

MIJAVGIHIXUJ^VLVCDRVERGTHFWLL 

VFMPLFFVSPVSVAACVWOFRHDRSLELEELC 

SVNILQFIFIALKLDRIIHWPWLWFVPLWILM 

SFLCLVVLYYIVWSLLFLRSLDWAEQRRTH 

VlMAIoWJ JlVVrlALirliVLX.VHRIJ^GHrfit'o 

YVSIFVPLWLSU-TXMATITRRKGGNHWWF 

nlluUJtllAiiJ^ilJrKirl UKrrrrrJL lX>ilHOEKA 

LPLQNKDRGSWPASRGSPRLL 


1165 


2515 


A 


9362 


547 


991 


DVSIGPPLLRRPCSGREQTRSLSFPSDPESSFSP 

VPEGVRLADGPGHCKGRVEVKHQNQWYTV 

CQTGWSLRAAKWCRQLRCGRAVL'nQKRC 

TKHAYORKPIWLSQMACSGPEPTLHDCPFRP 

LOEDTUFHVEYTSVHGRHILSAKD 


1166 


2516 


A 


9363 


201 


387 


PPlLRWrPPSGRNFFFEFFFESEFlf/SSPRVECS 
GAISAHLAHCM.CLPGSSDSPASAFQVAS 


1167 


2517 


A 


9368 


707 


1087 


AVLTPCLSPCSPSRIPRPVSRPYPGRRSLSHrPP 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 

hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ngtofiist 

amino add 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
add residue 
of peptide 
sequence 


Ammo acid sequence (A=Alanine C=Cysteine, 
D=Aspartic Acid, E=Glulamic Acid, 
F=fhenylalanine, G=Glycine, H=Histidine, 
I=Isoleucine, K=Lysine, L^Leucine, 
M=Metluonine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Aigininc, S=Serine, 
T=Tlireonine, V=Valine, W=Tiyptop}ian, 
Y=^ryrosine, X=Unknown, •=Stop codon. 
/^possible nucleotide deletion, V=possible 
nucleotide insertion 














PRPLILYAPAPVRPAGTAFIPHSHPPPPDLLRPT 
AXPA/TPCPSIPPPPRPLHPTQPSTALLPDPPPW 
PLPFPPPSS/RPPRPDCSTSYSPTFPPPT 


1168 


2518 


A 


9375 


511 


IS 


MMLSEETSAVRPQKQTRFNGAKLVWMLKGS 

PITVTSAVnVUvlLIAlM/IFSPWLATHDPNAID 

LTARLLPPSAAHWFGTDBVGRDLFSRVLVGS 

QQSDLAGLVWATTGMIOSPLECLFGELGGRA 

DAIFMRVMDIMRS/IPSLVLTMEK.TAALGPSL 

FNAMQASSEH 


1169 


2519 


A 


9377 


42 


410 


C^JGRVAPRDPGAVASAliPGL'lTHDSGVNPN 
NSARRMEAMASGSNWLSGVNWLVMAYWS 
LVFVLLFIFAKRQIMRFAMKSLROPHGPVOH 
NAPKDLKEEIDILLSRVHNIKYEPVHLLADDDA 


1170 


2520 


A 


9378 


302 


1303 


GVSGFSASVLRQRRMEDELEPSLKJPRTQIQGR 

nJXnCAAGIGGTFQFGYNLSnNAPTLHIQEF 

TNETWOARTGEPLPDHLVLLMWSI IV<!1 YPT 

GGLFGALLAGPLAITLGRKKSLLWNNIFWS 

AAILFGFSRKAGSFEMIMLGRLASWGVNAGV 

SMNIQP\MLPGGESAPKELRGAVAMSSAIFTA 

LGIVMGO WGLSTTAATnLRni AAORT .F.RT .FR 

ERAACQGCRARRPWELFQHRALRRQVTSLV 

VLGSAMELCGNDSVYAYASSVFRKAGVPEA 

KIQYAnOTGSCELLTAWSVSLEGALPPPAL 

WGGTPRSFALNQFTLQKKKX 


1171 


2521 


A 


9381 


2 


412 


RGPASAQEDERARTAPLERVRARGRMTTSSA 
LFPSLLPCSWSTSNKYLAEFRAGKMSLKGTTE 
TPDKRKGLAY/IQQTDDSLIHFCWKDRTSGNV 
EDDLIIFPDDCEFKRLPQCPNGRVYVLKFKAG 
SKRLFFWMQEP 


1172 


2522 


A 


9384 


20 


335 


OWNGRSTEASPAAEAPHVPHKET\KAAMGTQ 
CTHGGKVRPDPHDMLTTVVHKIKLFVLCHSL 
LQLCAaflSDYLKSSIYTVEKRLGLFRPTSGLL 
ASFNEVGNTALIVLESY 


1173 


2523 


A 


9393 


430 


87 


UX}CIVPGQQKETFSLNPSSATVKFYL*LSLQ 
QRKEDQ«IIL*YHLNKDCLHIFMSAnT-YMKr» 
KIFVLFDFNIMFETPFY1I*FIFLFSQNLKRIRQV 
IRPPISFSKINNGP 


1174 


2524 


A 


9397 


77 


374 


ERLEIGRLGGERGSGPASCLRVIDVS<avIWDQ 
RLVKLALLQLLRAFYGIKVKGVRVHRDCGTF 
ESSSTLIRVS*FGVPCNALAHFGVTHF*YILDF 
LGML 


1175 


2525 


A 


9399 


£6 


397 


HESSRADRDKMDTRGSTYTDADPVNKSGGT 
AKMNKWSKGKVRDKLNNLVLFDTATYDKL 
CKEVPNYKLITLAVVSERLKIPGSLARAALfffi 
LLSROU'LVIQHIAQVIY 


1176 


2526 


A 


9408 


2 


299 


LDLTHV1^LS1SLTVTILGTTFC»1VIPLLDVVY 
GERGYAQNGDF*DAQLDDYSFSCYSHAQVN 
GAPNSLTOAYDDP»VKISGLECX}KVGALVEV 
KCLNL 


1177 


2527 


A 


9416 


2 


402 


CNFLRSSRIRVHSTPAASTMPPKVDPNBIKYV 
YLRCTGGEVRATSALAPKIGPLGLSSIKVGVD 
FV»ATGDWNVLnSVILTIRILLSHIFVVPPFFCF 
DHLIAFWDLQSLIFLHVIFSLFITLLLFCFFSIF 


1178 


2528 


A 


9419 


142 


426 


TTLFDL'WPRWLSWLETVLTSLRTRRAASGPP 

ACRIMPTTVDDVLEHOGEVHFLQKQMLYLL 

ALI*DTFAPIYVGIVFLGnTOHRCRSPGVAEL 


1179 


2529 


A 


9420 


1450 


1655 


LSSAGTKMNLN«KNYWPGASAHACNPSTLG 

GQSRCrrRSGDRDHPG*HGETPSVLKIQKISRA 

WWRAP 
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SEQID 

■Kin- nf 

nucl- 
eotide 

uence 


SEQID 

INU. OI 

peptide 
seq- 


Met 

tin/1 

noa 


SEQ 

UJ NKJ, 

in 

USSN 
914 


Predicted 

beginning 

nucleotide 

location 

concspondi 

ngtofiist 

amino acid 

reaiuuc oi 

pq>tide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last aniuio 
acid residue 
ofpqitide 
sequence 


Amino acid sequence (A=AIanine OCysteine, 
D=Aspartic Add, E=Glutanuc Acid, 
F=4'henylalanine, G=Glycine, H^Histidine, 
l==Isoleucine^ K-Lysine, L=Leucine, 
M^Metluoninc, N=Asparagine, P=Prolme, 
Q=GhilBmine; R=AiEinine^ S=Serine, 
T^Threonine, V=VaIine, W=Tryptophan, 
I— lyrosine, X=UnKnown, *=Stop codon, 
/=7cssible nucleotide deletion, V^ossible 
nucleotide insertion 


1180 




A 
l\ 


QA09 


1 /O 


TIC 


HRPQriKl'DWKPRT*PQGK*QRLSSEISPASPF 
SRFSRSTKPVPPKADFPARQKLTGVLHAPIXK 
L 


1181 


2531 


A 


9436 


2 


274 


PIAASLRMyMLQPYlliliNUCTAFATMVETVP 
IARTILDRLTGIPHGYCFVE*ADWATADKCVH 
lYNQKPLPOATPLLSLQLHQLAHLGS 


1182 


2532 


A 


9442 


3 


240 


VDKCSSKSWLSEYCPHCMCSLSTDPKPFGQL 
SMILK*MGAGDEKISAMGKARVDHRELYLGL 
LYPTEDYKLTFRARH 


1183 


2533 


A 


9444 


384 


3 


LlOJFQPWALHDWPIJFCCXriTLLFLVLECFrR 
KGCSGWAPWLSLQCQHFGRPRWADHLRSGV 
RDQPGQYSKTTFLPKIQIOAGHSGAHL* S*LL 
ERMRWKNRLNPGGRSCSEPRWHHCTPGWAT 
ERG 


1 11M 
1 104 




A 


9462 


391 


655 


I^GFKSLNIPKIPLQYIYVRVRTTWSFCLPLDG 
RKLAlLS*YSK*LT*KYNILPEySRMTLPPOMV 
IHTCNPSTLGGRAGWIV*AQEFET 


118S 


2535 


A 


9467 


215 


566 


RCPMWQGQASRMDPAKAKDREASTCCSLA 
WWWGWBCWVRALKLSSGPAGPLACWVAK 
KKSLSLSGPVYPSEKGAGLYVF*DRVSLCHPG 
WSAWQFWLTAASNSCFSLLSSWDYRCA 


1186 


2336 


A 


9468 


273 


432 


HIPQLHTKTHYVPTRMVNKI*QIDNSKPWQR 
GG*TGrLTHCW»ESICLVQPLWKIVWHYQ 


1187 


2537 


A 


9469 


388 


3 


EVAPGPSQILPRRVTDGGDRPQFSLPGPRLPQ 
SSRGAEPCLSNCMSPAPRKQRMGDSDQ'STP 
NPASPHPEAPQEPWDSASGSVGSFSLOROAK 
ASS*VPGKGRGPRQGSELLAETILELFLALAN 
S 


1188 


2538 


A 


9471 


124 


397 


TMDKKNRHGNSLDMASEIHMTGPMCLIENTT 
GRLMANPEALKILSAn'QPMVEEAIAGLYRAC 
♦FYLTNNLAGMKKGLCLGSTEQAHnGI 


1189 


2539 


A 


9480 


584 


769 


GHVQSQHFGRPRRADHLRSGDRDHPG*HDET 

PSLLKIQKISWAWWRAPWPATWEAEAEEW 

R 


1 ion 




A 

A 




463 


86 


VTVGLTLLLRGAPRFTAG*PPSGGGPPLAPLL 
PRQHCni,QTHRHLHPEAPVKV*KT*RLFPGLR 
GASSCatRRRCNPVLAARKAGSPRSHSTRENC 
RRSRCPDTAHRRItRRGRRRNPSCVRSPRWR 


1191 


2541 


A 


9489 


1 


411 


LADALCLSAAATGAVRPGARAQPSTRRRLSP 
oVKVCCKAAAAoNLLYobCLQRHSERASEEG 
ERGSLSAKCCSLVLRGGCSSSNSHSFRRrr*EI 
MAAFVLLSYEQRPLKRPRLOPPDVYFPDFKQ 
KEEBLTAVNVK 


1192 


2542 


A 




^RQ 


1 A1 
lOl 


vorL^M&>o(jxlCiKo IKljoJsMVo WaVlAKJQEI* 

CHEDERKMAREFLAEFMSTYVMMNIHMIVE 

KDTYSDHEEINTS 


1193 


2543 


A 


9509 


186 


1 


IAKSQ*KRWQRSGAMETLKHGWWECKLVQF 
FGIC1TVNVN*S*TYVYPCDKIILLLGLYPTEM 


1194 


2544 


A 


9512 


58 


433 


PLQRSKCL1LRCLRAKPWAWSQSPRACSSAL 
LKSSRSRASSLNVQCILQSNPQGHQRI'KQKA 
SSKGOOFRR*KPHPFMI KTI MKT HTFfST*! Iff 

RRAIYDNPTANHVEGQKLEAFPLRTOTRQ 


1193 


2545 


A 


9515 


595 


1223 


GHGAPSFQTQVPRTP»ASWPVVPAASESAPAP 

AGGGASLPVAAGSCAAAPHrePOAPQHLLDC 

PCTLCLARPPRRPLPDTCYGPGSGRSASLAEPP 

LPRCSCAPLRSASAPQVS*CV*AVNLLPHNL* 

PIJiLIXHD*EKAWQFLFSSASHCFQGQICLLP 

APGSGPCGATARPSRGGRAGGSRARRPIPPGP 

GTRRTPSGCQNPAASGG 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

Dg to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 

location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alaiiine OCysteine; 
i-*=/isparuc ACia, cFvjlUL&JnlC AClu, 

F=PhenyIalanine, G=Glycine, I^Histidine, 
I=IsoIeucmc, K=4.ysmc, b=Leucine, 

QKjtutamine, R»!Afg^uBc, S^Scaine, 
T=Thrconine. V=Valnie, W=Tiyptophaii, 

/=^)ossible nucleotide deletion, >r=possible 
nucleotide insertion 


1196 


2546 


A 


9518 


229 


468 


AERVAPGWDLHrPYLPRTNSRRTPHL**EPHA 
GYIGALFPVfSfKiWPOfiO 


1197 


2547 


A 


9521 


289 


448 


lAWLSGLFFPSNQANLCFLCYKLTADSRYRG 
APNWKYKYGY*IPVDMLC 


1198 


2548 


A 


9524 


204 


1 


KNKK'nKCLSIVTL>aSGPNQ*NKRHRVAEWI 
VKQEPNICHL*ETHFPFRDTYRLKEREQKKRK 
SSYS 


1199 


2549 


A 


9546 


1785 


1943 


GGRFKESKLTNAGWQRNSFFIGPPKSIPWAA 
V*QRGDGKNPGVTHLNRPVOTX 


1200 


2550 


A, 


9548 




1 


VNAJcJSJil"JUQHirMTKJSQNKLnIEHTYLKPI 
JCArroKWTSDIMU«,QKL*AFFLRVIVRQI 


1201 


2551 


A 


9549 


591 


2 


SSWEFPRGPRSSLPPLDb'lVPCGSSPNWTGGC 
GSCPSGE*LVSPGSEQRKKYSNSNVIMHETSQ 
YHVQHLATFIMDKSEAITSVDDAIRKLVQLSS 
KEKIWTQEMLLQVNDQSLRLLDIESQEELEDF 
PLPTVQRSQTVLNQLRYPSVLLLVCQDSEQSK 
PDVHFFHCDEVEAELVHEYMESALTOCRLGK 


1202 


2552 


A 


9552 


428 


1 


KYGNEGHWSRQCPNPGKPKPCPLCRGPHWK 
LDCERPPQGPLPSLPELAKTSYSDLTGLATBD 
•WGPGMDAPATTIASSKTRVTUvIVAGRPVFF 
Ur YRAl X SALJT^FSGITQSSQ VSVVGIDGQV 
SKPRATPPLFCSUTTF 


1203 


2553 


A 


9568 


517 


738 


RRKFERKQKQ*RyREGKQYRQRDKMKEWG 
EKBKRRREKGEREERKMRHRERKGESGQRD 
1 MUN WKVERL IkKER 


1204 


2554 


A 


9573 


83 


415 


EDKRLRLVDGDSRCAGRV»IYHDGFWGnCD 
DGWDLSDAHWCXJKLGCGVAFNATVSAHFG 
EGSGPIWLDDLNCTGTESHLWQCPSRGWGQ 
HDCRHKH3AGVICSEFTALR 


1205 


2555 




9577 






AKuaCJf 1 KrKl ANOKMuliTKDArQML V 1 J"K 
DVAVTFFREEWRQLVLVHRTLYR*GMLETC 
GUJJTLRHNVPQPDVVHLLYHGTQLUVKRE 
VSHSPCAGDMRELFTREATLTPHPYNNGA 


1206 


2556 


A 


9584 


38 


476 


TLGAVLFSEVSKESSTSHSGGQLGRQNRHPKL 
NPQPLSTPSWQIETKYSTKYLTGNWlvflEERRK 
YNPGLPPLRTWNGQKLLWL 


1207 


2557 


A 


9586 


2 


412 


UlSSPAALUlALCnTVTGTALALRSRVATm 
PIXKa«mi<rKnryRLCDKAESWOIALETVPT 
GVAVTSWAIMLTVLTLVCICOQDYNRRQKLP 
THIT rn .1 .♦EK'frlFGI .TRAFTIfJT rv^gmurui.'U'i 

FGILFSICFS 


1208 


2558 


A 


9597 


122 


3 


IKNYWPGMVAHACNPSPLGGRGRWIA*AQK 
FADAWADAW 


1209 


2559 


A 


9611 


148 


528 


KSUtNVWDLLNNTWKADRFFCHSSRTSTKK 

OL/TAJJri roKiWol w i ouKI 2>od XJKluJclvKNI xU 

RIRDHDLLDKRKTVTALKAGHDRAILLGLAM 
MVCSIMM»FLLGITLLRSYMQSVWTRESQCT 
LLNASITETFNC 


1210 


2560 


A 


9618 


384 


2 


SUIDMLNILAEQQQKQICWAVNTQNTAWSNA 

DSKFGQRILEKMEWSKGRGUjVQEQGGPDDI 

KVQVKNNDLGLQATINNEANWIAHQDDFNW 

LLAELNrCQRQETADS**»WSPKNSHVOKDS 

GELSAK 


1211 


2561 


A 


9620 


316 


610 


QKHPOOGQLORSPQBDSRFHNKASSGVSRVR 
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nucleotide 
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Amino acid sequence (A=Alanine CXJysteine, 
D=AspaTtic Acid, E=01utaniic Acid, 
F^Phenylalanine, G=Glyciiie, H=4{istidiiie, 
I=Isoleucine, K=4.ysine, LsLeucine, 
M=MetbiQnineL N=Asnaraf inft^ P=:Pmline. 
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A=pos5ible nucleotide deletion, \=^ossible 
nucleotide insertion 














LGRAWWLl VVIKILWKAKAGGSPE*!)* AGRG " 
GSRL* SQHFGRPRRVDHLRSAVQDQPGQHGE 
TPSLLKIOKIN*VWfTRRI ♦S"IV<!RAFAf5F<!7 


1212 


2S62 


A 


9623 


297 


344 


QFPVDGDYQKIEfaTQLFQAQNLSIXLAMrR 
TREL*KGGGKGRHE*AVVI'FI KKfifiYflVirAP 

AIUfrSNCT*GF*ETKMLSDDPKACVFEVSSA 
DL*NTSFOVIR 


1213 


2563 


A 


9624 


2 


356 


TSGTELSAISCLWDLPTPVLRVGLSCQPSMSSQ 
IPR\fY3TDVEAAVNSLEDLYLQAYYAYLCVG 
LYFHRDDMALEGVSRFL'ELAE 


1214 


2564 


A 


9634 


776 


912 


SI.SHWVllAlfl *VPVMnPMf*i >jPT?r;rinr*cpi>D 
SHYCTPAWATHKDS 


1215 


2565 


A 


9636 


220 


426 


KPGNFAVSSEY*DITSGQLKTAVRG*IENrrST 
EENFGEKLHDIGFGNGnx»KT*KAQATKAKI 
DK 


1216 


2566 


A 


%37 


391 


76 


RERRSKFKMKNHSPDDTKENVTISNIRTRKI 
NHLPETERNLLEHGLMYIRLNAAFCSLVAHS 
LFGFIUCAT 


1217 


"2567 


A 


9655 


2008 


2432 


LHCKMGALETQTHPCSQNMLRSLQKCXICKV 
tvHttrUA^r V y V UKll Lf LJnoA J AO IXjC-KRPAKrF 
PAPPTPTPWRSRQSGKQSERAS*LKGRGRYGL 
GALGGRGGRALGGSRWPPPLPGETLFSGCKH 
RRRRRGSDAAPGEEAGT 


121S 


2568 


A 


9658 


3 


405 


VLMSAQEPWAIKEEHVnQAEFYLNPDQSGEF 
MLDEEGEDTFHGDMAKKETVWRIJE*LARLD 
NFEAQRALANIAADQAALEIMDMOSDyTLIF 
NVPPKVTVL 


1219 


2569 


A 


9662 


3 


284 


PDWIHKJRKMQDTGSILPLHWFGFGYAALVA 

YfirrTTfSWlir AnoVPCT a a/^t t B/^ct cr*i r* A vr\ 
lovniuz Vftjvvjo vi'DJ-AAoJLAjr^joJ.fJwjJjOAY 

LSQDPRNVWVFLATSGTLAGIMGMRFYHSG 
KL 


1220 


2570 


A 


9669 


200 


699 


LLLTGYIQTLQNQQLSGNQQEMQAVDNLTSA 
PGm^LCTRDYKnQVUTIXYTVLFFVGLITN 

frl AKyTRTPPOTPdrOMPlTPT VXTFV/ICT^T T hXTT TC 
\JA>»rUVU\JJ*P\^lI\orkOiNriiI*J.<NIN 1 V l^tJl..I-.ivlll . 1 r* 

PFKILSDAKLGTGPLRTFVCQVTSVIFYFTMYI 
SISn.GUTroRYQKTTRPFKTSNPKNLLGAKIL 
K 


1221 


2571 


A 


9676 


164 


562 


KERDSSri''SAA\frTMOGMEnAMPOAr}P<TVP 
QLGNMAVmSHLWKGLQEKFLKGEPKVLGV 
VQILTALMSLSMOrrVIMCMASNTYGSNHSV 
YIGYTIWGSVMFnSGSLSIAAGIRTTKGr VRfi 
SLGMNTTSS 


1222 


2572 


A 


9688 


43 


412 


VAKMVKCCSAIGCASRCLPNSKLKGLTFHVF 
FTOENIKRKWVLAMKRLDVNAAGrWEPKKG 
DVLCSRHFKKTDFDRSAPNDCLKPGVIPSIFDS 
PYHLQGKREKLHCRKNFTLKTVPATNYNH 


1223 


2573 


A 


9696 


308 


564 


RTSMGILYSEPICQAAYQNDFOQVWRWVKE 

DSSYANVQDGFNGDTPLICACRRGHVRIVSFL 

LKKECLCQPQKPERENUALCCE 


1224 


2574 


A 


9700 


3 


632 


DAWASGGELGSLFDHHVQRAVCDTRAKYRE 

GRRPRAVKVYTINLESQYLUQGVPAVGVMK 

ELVERFALYGAIBQYNAUJEYPAEDFTEVYLI 

KFMNLQSARTAKRKMDEQSFFGGLLHVCYA 

PEFETVEETRKiaxjMRlCAYVVKTTENKDHY 

VTKKKLVTEHKI3TEDFRQDFHSEMSGFCKA 

ALNTSAGNSNPYLPYSCELPLCYFSSK 
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Amino acid sequmce (A^Alanine 0=Cystcine^ 
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F=4Tiaiyla]anine, GOIycine, H=Histidine^ 
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T=Thr60nine, V=Valine, W=Tiyptc^Aan, 
y=Tyrosinei X=Unknown, ♦=Stop codon, 
/=possible nucleotide deletion. \=^ssible 
nucleotide insertion 


122S 


2575 


A 


9710 


1 


163 


RSGCVLRMTEWETGAPAVAETPDIKLFGKWS 
TDDVHINDISLQDYIAGVRLILL 


1226 


2576 


A 


9713 


82 


492 


QGLPSFLPAFOPSGSWLGPAPTLGSSCNTVDT 

ICHGYSEERPLFYLSFCDLIXGLCWLTETLLYO 

ASVANKDHCYNLQAVGQIFyiSSFLYTVNYI 

WYLYTELRMKHTQSGQSTSPLVIDYTCRVCQ 

MAFVFSSLI 


1227 


2577 


A 


9720 


3 


416 


GKWKJRTQVPLLGEECADMDLARKEFLRGNG 
LAAGKMNISlDLDTNYAELVLNVGRVmJEN 
NRKKMKDCQLRKQJQNENVSRAVCALLNSGG 
GVntAEVENKGYSYKKDGIGLDLENSFSNML 
PFVPNFLDFMQNGNYF 


1228 


2578 


A 


9723 


278 


411 


EASSSNTVASNVADKTDPHSMNSRVFIONLN 
TLVLQKSDVEAVF 


1229 


2379 


A 


9725 


121 


902 


GGSGGPYSKQYAGYDYSQQGRFVPPDMMQP 

QQPYTGQIYQPTQAYTPASPQPFYGNNFEDEP 

PLLEELGINFDHrWQKTLTVLHPLKVADGSIM 

NETDLAGPMVFCLAFGATLLLAGKIQFGYVY 

GISAIGCLGMFCLLNLMSMTGVSFGCVASVL 

GYCLLPMILLSSFAVlFSLQGMVGIILTAGnG 

WCSFSASKIFISALAMEGQQLLVAYPCALLYG 

VFALISVF 


1230 


2580 


A 


9739 


11 


247 


TFVLNMNTPKEEFQDWPIVRIAAHLPDLIVYG 
HFSPERPFMDYFDGVLMFVDISGKCKRDVCL 
MWMSNELAWEFTCRA 


1231 


2581 


A 


9744 


37 


1100 


TPLFDFWPGFVLSWLQPLSASLRARRAASGPP 

ACRIMPTrVDDVLEHGGEFHFFQKQMFFLLA 

LLSATFAPIYVOrVFLOFTPDHRCRSPOVAELS 

LRCGWSPAEELNYTVPQPQPAQEASPROTRR 

YEVDWNQSTFDCVDPLASLDTNRSRLPLGPC 

RDGWVYETTGSSIVTEFNLVCANSWMLOLFQ 

SSVNVGFnOSMSIGYIADRFGRKLCLLTTVLI 

NAAAGVLMAISPTYTWMLIFRLIQGLVSKAG 

WUGYILITEFVQRRYRRTVGIFYQVAYTVGL 

LVLAGVAYALPHWRWLQFTVALPNFFFIJLY 

YWaPESFRWLISQNKNAEAMRIIKHIAiaKNG 

KSLPASL 


1232 


2582 


A 


9753 


164 


517 


PGPGMQGPPPTTPTSWSLPPWRAYVA/VAVLC 
YINLLNYMNWFOAGVLLDIQEVFQISDNHAG 
LLQTTVFVSCLLLSAPVFGYLaDRHSRKATMS 
FGILLWSGAGLSSSnSPRYSWLF 


1233 


2583 


A 


9757 


25 


419 


LPAPWTERVRKSEGLVGTCLGDPMASPRTVT 
IYALSVALGLFFVFMGHKLTPKLSKDAYSEM 
KRAYKSYVRAIJLLKKMGINSILLRKSIGALE 
VACOrVNlTLVPGRPKDVANFFLLLLVLAVLF 
FHQLV 


1234 


2584 


A 


9765 


71 


456 


RLELDWGFSLHFLPVAYIXPLSSGFEMNVQP 

CSRCGYGVYPAEKISCIDQIWHKACaUCBVC 

KNI\fl^VNNFVSHQKKPYCHAHNPKNNTFrS 

VYHTPI^JLNVRTFPEAISOIHDQEDGEQCKSV 

FHWD 


1233 


2585 


A 


9767 


52 


559 


IRSGAMSVDKAELCGSLLTWLQTFHVPSPCA 

SPQDLSSGLAVAYVLNQIDPSWFNEAWLQGI 

SEDPGPNWKLKVTSGLLIRGQTGEEMrRDGP 

ARHMSWVMGRKRDRCLVINHLFIHSSMEYSP 

CARPGHSARNNTDKNmrrAIILVrSNrmi 

KINFQAGRSGSCX 


123^ 


2586 


A 


9770 


352 


608 


FRGEALTVRFLTXRFIGEYASltaESIYKKHLC 
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Amino acid scqumce (A=Alanine OCysteine, 
l^Asoartic Acid. E=Olutamic Acid 
F=Ptenytalanine, G=Glycine, H=Histidine, 
Msoleucine, K=Lysine, L=Leucine, 
\^=MethiODine,N=Asparagine, I^ProIine, 
QMjhitanune, R=Atginine, S=^erine, 
T=Threonine, V=Valine, W=Tiyptophan, 
Y=TjTOsine. X=Unlcnown. *=Stop codon. 
A=possible nucleotide deletion, >r=^x)ssible 
nucleotide insertion 














LERKQLNLEIYDPCSQTQKAKFSLTSELHWA 
DGFVIVYDISDRSSFAF/UtALI 


1237 


2587 


A 


9793 


266 


515 


NILAHYFPFPlOJFLLlU^SQSNPEj^FALTLCHH 
QKEKNFQILPVSIDALTPPLVVCFLVSFLTHFS 
RYKPTRPVOTQFQGCS 


1238 


2S88 


A 


9802 


537 


967 


aXJAGRSDREAMEAAVKEEISVEDEAVDKNI 

TDEDSTRFQENEASKVPLLAEIYOIEGNIFRIJC 

INEETPLKPRFEVPDVLTSKPSTVRLISCSGDT 

GSLILADGKGDLKC 


1239 


2589 


A 


9805 


105 


540 


MNKRQLYYQVLNFAMIVSSALMIWKOLTVLT 

GSESPIVWLSGSMEPAFHRGDLLFLTNFRED 

P1RAGEIVVFKVEGRDIPIVHRVIKVHEKI3NG 


1240 


2590 


A 


9819 


3 


305 


TDGRDPLPCAARRRGGGGECCGAGWVAEWS 
PQPLDPAMLLWMQGFVLEAVACQDNDDYLR 
YODLFEDLDCNGDGWDIIELQEGLRNWSSAF 
DPNSEEHG 


1241 


2591 


A 


9834 


841 


1209 


SPARGKSNRTDVMrrAPKNKKMTENLAAPEA 
U>SSTHSSSTATQSRAKMNTPAPTPSTVPAIPR 
UOoUOl'rjrUArlllJK V oo VLl^ClJl VAM 
SSHSWEFLFKRTKTPSPFHPAVRENRN 


1242 


2592 


A 


9843 


3 


589 


TISCGPATEPPASLLSSASSDDFCKEKTEDRYS 

DGSVKCTHQPCLIKWISERGCWSCELCYYKY 
HVIAISTKNPLQWQAISLTVIEKVQVAAAILGS 
LFLIASISWLIWSTFSPSARWQRQDLLFQICYG 
MYGFMDVMIVAVDSEDMVQAAKEVGKRWS 
DIPP 


1243 


2593 


A 


9846 


198 


411 


DGTHSNFILANAQVAKGFPIVYCSDGFCELAG 
FARTEVMQ 


1244 


2594 


A 


9848 


116 


650 


PICGFLYLCSAMASESSPLLAYRLLGEEGVAL 

PANGAGGPGGASARKLSTFLGVWPTVLSMF 

SIWFLRIGFWGHAGLLQALAMLLVAYFILA 

LTVLSVCAIATNGAVQGGGAYCILQHRWTG 

VWPVLPAREVMISRTLGreVGGSIGLMFYLA 

NVCGCAVSLLGLVESVLDVFGA 


124S 


2595 


A 


9849 


573 


1620 


KSKCRFPEOLSEOFGPMRKEALSSGSVQEAE 
AMLX)EPQEQAEGSLTVYV1SEHSSLLPQDMM 
SYIGPKRTAWRGIMHREAFNIIGRRIVQVAQ 
AMSLTEnVLAAAt ADHI PFTJKWSAFITB'RPI 

KSSLGYEITFSLLNPDPKSHDVYWDIEGAVRR 

YVQPFLNALGAAGNFSVDSQILYYAMLGVNP 

RFDSASSSYYLDMHSLPHVINPVESRLGSSAA 

SLYPVLNFLLWPELAHSPLYIODKDfrAPVAT 

NAFHSPRWGGIMVYNVDSKTYNASVLPVRV 

EVDMVRVMEVFLAQLRLLFGIAQPQLPPKCL 

LSGPTSEGLMTWELDRLLWARSVENLATATT 

TLTSLA 


1246 


2596 


A 


9850 


114 


464 


PPQLGAQRVREPRHPDVRAPLRVTSPGLRSRS 
ARSLGRRPRIAMVTVGNYCEAEGPVGPAWM 
QIXjI5PCFFFn.VPSTRMAIXJTLALVLALPCK 
RRERPAGADSLSWGAGPRISSYV 


1247 


2597 


A 


9851 


2 


327 


FVRNKKMTRSCSAVGCSTRDTVLSRERGLSF 
HQFPTDTIQRSKWIRAVNRVDPRSKiawiPGP 
GAILCSKHFQESDFESYGIRRKLKKGAVPSVS 
LYKVFKYSSRCTS 
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seq- 
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Amino acid sequence (A'^Alanine C=Cysteinei 
D=Aspartic Acid, EN3hitamic Acid, 
F^Phenylalanine; G=Glycine. H^Histidinc. 
I=Isoleucine, K=4>ysinc, Lf=Leucine, 
M=Mefliionine, N=Asparagine, F=Proline, 
Q=Glutamine, R=Argfaiine, S=^erine, 
T=Threonine, V=Valine, W=Tiyptoplian, 
Y-Tyrosinc, X<=Unknowo, *=Stop codon, 
/=possible micleotide deletion. V=possible 
nucleotide insertion 


1248 


2598 


A 


9853 


58 


444 


RVDDFWSKGGKDAGGADVSLACRRQSIPEE 

FRGITWELIKKEGSTXGLTISGGIDKDGKPR 

VSNLRPGGLAARSDLLNIGDYIRSVNGIHLTR 

LRHDEIITLLKNVOERWLEVBYELPPPGGCP 

WT 


1249 


2599 


A 


9856 


2 


1265 


LPPPRPSRHKRGRAGTRASAAAAAGPTVSAV 

RAPVRGQDSGAGTPQGRLAGRGAHLSRVGA 

SGSGVAAGPAARHAPRRRCADAGEAVOASC 

GRCAVALLSOVCTLVSTHVCVGSQCPGAAGT 

PMGAGDAGASAESAVTTAPQEPPARPLQAGS 

GAGPAPGRAMRSnXLALLALVLLYLVSGAL 

VFRALEQPHEQQAQRELGEVREKFLRAHPCV 

SDQELGLLIKEVADALGGGADPETNSTSNSSH 

SAWDLGSAFFFSGTDTTIGGGQDWHVGGGK 

ELPHGGRCRErEGSQVAPRLPASPUa>OYGN 

VALRTDAGRLFCIFYALVQIPLFGILLAGVGD 

RLXJSSLRHGIGHIEAIFLKWHVPPELVRVLSA 

MLFUJGCXLFVLTPTFVTCYMEDWSKLEAIY 

FVIVTLTTVGFGDYVA 


12S0 


2600 


A 


9873 


2 


652 


FWPSPCGOIPGRAPNQASRPTMGNSASRNDF 

EWVYTDQPHTQRRKEILAKYPAIKALMRPDP 

RLKWAVLVLVLVQMLACWLVRGLAWRWLL 

FWAYAFGGCVNHSLTLAIHDISHNAAFGTGR 

AARNRWLAVFANLPEGVPYAASFKKYHVDH 

HRYUKJIXJLDVDVPTRLEGWFFCTPARKLL 

WLVIXJPFFYSIJU'LCVHPKAVTOMEVLKrLV 

QLA 


1231 


2601 


A 


9875 


ISO 


1209 


PVIMPLHFSPGDIVRPSCCVSSSPKI-RRNAHSR 

LESYRPDTDLSREDTGCNLQHISDRENIDDLN 

MEFNPSDHPRASTIFLSKSQTDVBEKRKSLFIN 

HHPPGQIARKYSSCSTTFLDDSTVSQPNLKYTI 

KCVALAIYYHIKNRDPDGRMLLDIFDENLHPL 

SKSEVPPDYDKHNPEQKQIYRFVRTLFSAAQL 

TAECATVTLVYLERLLTYAEIDICPANWKRIV 

LGAILLASKVWDDQAVWNVDYCQILKDrrVE 

DMNELERQFLELLQFNINVPSSVYAKYYFDL 

RSLAEANNLSFPLEPLSRERAHKLEAISRLCED 

KYKDLRRSARKRSASADNLTLPRWSPAIIS 


1232 


2602 


A 


9879 


6 


376 


KRPDSRPPAQYRAGPTRPRTRGCELLYWKAT 
KAVGKMGSLSTANVEFCXDVFKELNSNNIG 
DNIFFSSLSLLYALSMVLLGARGETEEQLEKV 
WNSSEVCSEPRSLSCSRSGSAKLILSLYQ 


1253 


2603 


A 


9880 


180 


388 


KEQABLLYQLYCQCDL'IISSHPSSVPAMSSC 

NFTHATFVLIGIPGLEKAHFWVGFPLLSMYVA 

AMFGNC 


1254 


2604 


A 


9881 


19 


494 


VISFQllTDnMDSSTAHSPVFLVFPPElTASEYE 

STELSATIPSTQSPLQKLFARKMKILGTIQILF 

GIMTFSFGVlFLrrLLKPYFRFPFIFLSGYPFWO 

SVLFINSGAFIMVKRKTTETLnLSRIMNFLSA 

LGAIAGDLLTFEFHPRSKLHL 


1253 


2605 


A 


9896 


72 


386 


RPGREQRDCFQAPPLGLGGRQTDMMHHPLT 
vu\ I C VvjUrN VIjMC.PQLSGALTFMYLQQGNQ 
EATVAPDTMAQPYASAQFAPPQNOIPGEYTA 
PHPHPAPEYTGQTT 


1236 


2606 


A 


9902 


95 


399 


SGGPAGLLHRPVLPKMGLSGLLPILVPFILLG 
DlQEPGHAEGILGKPCPKIKVECEVEElDQCrK 
PRDCPENMKCCPFSRGKKCLDFRKVSL1LYH 
KEELE 


1237 


2607 


A 


9903 


374 


459 


niLKSTTNRLGWAHra«>STLGORGGW 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

conespondi 

ng to first 

amino add 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence. 


Amino add sequoice (A^Alanine 0=Cysteine, 
D=Aspartic Acid, Er=Glutamic Acid, 
^Phenylalanine, CNGlycine, H=Hjstidine, 
Msoleucine, K=Lysine, IM^ucine, 
M=MethiMiine, N=Aspaiagine, P=Proline, 
Q=GIutamine, R=Aiginine, S=Serine, 
T=Threonine, V^Valine^ W=Tryptophan, 
Y-=Tyn)sine, X=Un]aiown, *=Stq> codon, 
/=^x)ssible nucleodde deletion, V^ossiblc 
nucleotide insertion 


1258 


2608 


A 


9911 


364 


1974 


AGPGVPAVGORWASGPGLGGRTLCSGPPDH 

QRRGPSCGASGDPQCVGSPHPQRARPLLARP 

GARLLPGHLPSPRPPRLPTGQPPAAAFRGPVR 

PQOGOHIHPLPTPGGRPCFAVSEGSGSALLXS 

YLGECGSSSYVTGAAOSPVLRCREWFEAGLP 

WFYERGFLLHQKIALSRYATALHJTVDTSRL 

FRSRSLREFEEALFCarrKSFPISWDAYWDRND 

PLRDVDEAAVPVLCICSADDPVCGPPDHTLTT 

ELFHSNPYFFLLLSRHGOHCGFLRQEPLPAWS 

HEVILESFRALTEFFRTBERIKGLSRHRASFLG 

GRRRGGALQRREVSSSSNLEEIFNWKRSYTRL 

MAAAAGAAAAPGSREPQDRPECGAGHPGPR 

YYRHPERWLLRPEAFLGPUITRAPSAEDSQR 

ERPAARSGPEMRVRYPWAAVLAPYLALSQD 

PMYKSSASGQGASGSYNHVREEMLKAGGA 

MSRRWRQSKFRHVFGQAAKADQAYEDIRV 

SKVTWDSSFCAVNPKFLAnVEAGGGGAFrVL 

PLAK 


1259 


2609 


A 


9919 


693 


935 


GCa=KFIGESTCCWIFPSSVTTQCVVAKAPRAA 
TLSKAERLRSQPGPEQGOSSYRPRTPTAAAIL 
PPRPORSHRKRKLVSTK 


1260 


2610 


A 


9921 


455 


1082 


QRSCLCSAIEKDGGDVKALYRJRSQALEKLGR 

LDQAVLDLQRCVSLEPKNKVFQEALKNIGGQ 

IQEKVRYMSSTDAKVEQMFQILLDPEEKGTE 

KKQKASQNLWLAREDAGAEKIFRSNGVQLL 

QRLLDMGETDLMLAALRTLVGICSEHQSRTV 

ATLSILGTRRWSILGVESQAVSLAACHLLQV 

MFDALKEGVKKGFRGKEGAnV 


1261 


2611 


A 


9928 


1 


438 


GFRGAEAPGAAQAPKXKKPRPTEGGPGAGSG 

RGKDPYRGPTLLHQPKPPKDEFLSSLESYEIAF 

PTOVDHNGALLAFSPPHPQRQRRGTGATAES 

RLFYKEASPSTHFLLNLTRSSRLLAOHVSVEY 

WTREGLAWQRADRPHCLYA 


1262 


2612 


A 


9931 


168 


435 


AAEMGRAGAAAVIPGLALLWAVGLGGPPPA 
PPRLPFCLQELQ<aiHALHrFSLBRTCSYQDFL 
WADEGRLLHVGAQDLATWHTLSPLGLW 


1263 


2613 


A 


9938 


247 


488 


RMSATSVDQRPKGQGNKVSVQNOSIHQKDG 
CNDDDFEPYLRSPDNQSNSYPPMSDPYMPGY 
YAPSIGFFYSLGEAAWSQL 


1264 


2614 


A 


9941 


61 


277 


ESIGLTALGPRRRPWEHRWSDPITLKMKGWG 
WLALLLGALLGTAWARRSQDLHCGACKAVR 
RRVRQFNIYDY 


1265 


2615 


A 


9956 


2 


522 


FVASEVSKMPVPASWPHPPGPFUXTLLLGLT 

EVAGEEELQMIQPEKULLVTVOKTATLHCTV 

TSLLPVGPVLWFRGVGPGRELIYNQKEGHFP 

RVrrVSDLTKKNNMDFSnUSSnPADVGTYY 

CVKFRKOSPDHVEEKSOAGTELSVRGEYSVG 

FLSQVWWWLSSHPFMN 


"1266 


2616 


A 


10002 


243 


387 


PKNNACHLLFTAVCQPRCKHGECIGPNKCKC 
HPGYAGKTCNQGRKTV 


1267 


2617 


A 


10004 


36 


707 


LPAPASTWSVARETMASSSVPPATVSAATAO 

PGPOFOFASKTKKKHFVQQKVKVFRAADPLV 

GVFLWGVAHSINELSQVPPPVMLLPDDFKAS 

SK3KVNNHLFHRENLPSHFKFKEYCPQVFRNL 

RDRFOIDDQDYLVSLTRNPPSESEGSDGRFLIS 

YDRTLVKEVSSEDIADMHSNLSNYHQVRPLS 

SPILSLSSLLTYSSAIVSNRCQLORKUGRENP 


1268 


2618 


A 


10005 


2 


209 - 


GEGYELFVPSNGVPAVCHMVGRSPHRAVLSP 
SQDELEHSLGESAAQGAAGWLWVSWENTR 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09^4% 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
coire^ionding 
to last amino 
add residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
I>=Aspartic Acid. E=GIutamic Acid, 
.F=i'henylalanine, G=Glycine, H=Histidine, 
I^lsoleucine, K=Lysine, IM^ucine, 
M=Methionire, N=Aspaiagine^ PHftoline^, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown. *=Stop codon. 
/=passible nucleotide deletion, V^ssible 
nucleotide insertion 














TKVSL(HA 


"T269 


2619 


A 


10010 


245 


688 


FGMLKNKOHSSKKDNLAVNAVALQDHILHD " 

LQLRNLSVADHSKTQVQKKENKSLKRDTKAI 

IDTGLKKTTQCPKLEDSEKEYVLDPKPPPLTL 

AQKLGLIGPPPPPLSSDEWEKVKQRSLLQGDS 

VQPCyJCKEBFELRPQVFSIRG 


1270 


2620 


A 


10011 


2 


S88 


RVDDFVRPLPPGLMSRSRASIHRQSIPAMSYA 
PFRDVRGPSTHRTQYVHSPYDRPGWNPRFCai 
SGNQ1XMIJ3EDE1HPUJRDRRSESSKNKLJLR 
RTVSVPVEGRPHGEHEYHLGRSRRKSVPGGK 
QYSMEGAPAAPFRPSQGFLSRRLKSSIKRTKS 
QHCLDRTSSFRQILPRFRSADHDRYRQWSMW 
DEIDV 


1271 


2621 


A 


10013 


209 


363 


LPAPFNLSPRLSFGFQFPG<»JDNYLTrrGPSHP 
FLSGAEVSQSCKRRGGRA 


1272 


2622 


A 


10014 


7 


388 


SAVnSWKWRSVMGIQTSPALLASLGAGLVT 
LLGLAVGSYLVRRSRRPQVTLLDPNEKDLLR 
LnJKTLSARSPCKHIYLSTRIDGSLSIRPYrPVT 
SDEDQGYVDIDIKVYLKGVHPTFPEGGKMSH 


1273 


2623 


A 


10016 


1 


1339 


MAARTLGRGVGRLLGSLRGLSGQPARPPCX3V 

SAPRRAASGPSGSAPAVAAAAAQPGSYPALS 

AQAAREPAAFWGPLARDTLVWDTPYHTVW 

DCDFSTGKIGWFLGGQLNVSVNCLDQHVRKS 

PESVALIWERDEPGTEVRITYRELLETTCRLA 

NTLKRHGVHRGDRVAnfMPVSPLAVAAMLA 

rrFNCJGLRGGRWELKKTVDEAVKHCPTVQH 

VLVAHRTDNKVHMGDLDVPLEQEMAKEDP 

VCAPESMGSH3MLFMLYrSGSTOMPKGrVHT 

QAOYLLYAALTHKLVFDHQPGDDFGCVADIG 

WITGHSYWYGPLCNGATSVLFESTPVYPNA 

GRYWETVERLKINQFYGAPTAVRLLLKYGD 

AWVKKYDRSSLRTLGSVGEPINCEAWEWLH 

RWGDSRCTLVDTWWQT 


1274 


2624 


A 


10017 


1 


3750 


FRPQOTPRSPASHVLTMSAPDEGRRDPPKPKG 

KTLGSFFGSLPGFSSARNLVANAHSSARARPA 

ADPTGAPAAEAAQPQAQVAAHPEQTAPWTE 

KELQPSEKMVSGAKDLVCSKMSRAKDAVSS 

OVASWDVAKGWQGGLDTTRSALTGTXEV 

VSSGVTGAMDMAKGAVQGOLDTSKAVLTG 

TKDTVSTGLTGAVNVAKGTVQAGVDTTKTV 

LTGTKDTVTTOVMGAVNLAKOTVQTGVETS 

KAVLTGTKDAVSTGLTGAVNVARGSIQTGV 

DTSKTVLTGTKDTVCSGVTGAMNVAKGTIQT 

GVDTSKTVLTGTKDTVCSGVTGAMNVAKGT 

IQTGVDTSKTVLTGTKDTVCSGVTGAMNVA 

KOnQTOVDTTXTVLTGTKNTVCSGVTGAVN 

LAKEAIQGGLDTTKSMVMGTKDTMSTGLTG 

AANVAKGAMQTGLNTTQNIATGTKDTVCSG 

VTGAMNIJVRGTIQTGVDTTKn^TOTKDTVC 

SGVTGAANVAKGAVQGGLDTTKSVLTGTKD 

AVSTGLTOAVNVAKGTVQTGVDTnCTVLTO 

TKDTVCSGVTSAVNVAKGAVQGGLDTIXSV 

VIGTKDTMSTGLTGAANVAKGAVQTGVDTA 

KTVLTGTKDTVTTGLVGAVNVAKGTVQTGM 

DTTKTVLTGTKDTIYSGVTSAVNVAKGAVQT 

GLKTTQNIATGTKNTFGSGVTSAVNVAKGAA 

QTGVDTAKTVLTGTXDTVTTGlAiaAVNVAK 

GWQTSVDTTKTVLTGTKDTVCSGVTGAAN 
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SEQID 
NO: of 
nucl- 
eotide 
scq- 
aence 


SEQID 
NO: of 
peptide 
seq- 
uoice 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequoice 


Predicted end 
nucleotide 
location 
conespooding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 

F"=Phenylalanine, G=<!lycine, H=Histidine, 
1-lsoleucine, K=Lysine, L^Leucine, 
NW\4eUiionine, N=Aspaiagine, P==ProIine, 
Q=Glutaniine, R=Arghiine, S=Serine, 
T^Threonine, V=Valine, W=Ttyptophan, 
Y=Tyrosine, X=<Jnlcnown, *=Stop codon, 
A=possible nucleotide deletion, V^ossible 
nucleotide insertion 














VAKGAIQGGLDTTKSVLTGTKDAVSTGLTGA 

VKXAKGTVQTOMDTTKTVLTGTKDAVCSGV 

TGAANVAKGAVQMGVDTAKTVLTGTKDTV 

CSGVTGAANVAKGAVQTGLKTTQNIATGTK 

NTLOSGVTGAAKVAKOAVQGGLDTTKSVLT 

OTKDAVSTGLTGAVNLAKGTVQTGVDTSKT 

VLTGTKDTVCSGVTQAVNVAKGTVQTGVDT 

AKTVLSGAKDAVTTGVTGAVNVAKGTVQTG 
VT^ACI?' A\n iuf/^'TvrYn/T?o/^\rT'/^ axmc^m a a 
VlyAoKAVIjMOlruJl VrDuV 1 VJ AMonlAlLlj A 

VQGGLDrnCTVLTGTKDAVSAGLMGSGNVA 

TGATHTGLSTFQNWLPSTPATSWGGLTSSRT 

TDNGGEQTALSPQEAPFSGISTPPDVLSVGPEP 

nY/cJ\J\J\l IK.ULAllJVAlrl'QLjAAFljREIJrG 

LLATTHGPEEAPRLAMLQNELEGLGDIFHPM 

NAEEQAQLAASQPGPKVLSAEQGSYFVRLGD 

LGPSFRQRAFEHAVSHLQHGQFQARDTLAQL 

QDCFRL 


1275 


2625 


A 


10025 


124 


415 


TILARKXEICTCPCKKEIGRNSRSGMYSRKAM 
YKJKJS.Y bAANTK.VEJCKjCKEKVLAPVTKPVGG 
DKNGGTRWKLPTMPRYYPTEDVPRKLLSHG 
KKPFS 


1276 


2626 


A 


10030 


3 


507 


GGSLRFSPPRVPSCSRVFCTVPPGGCGLPSPMS 

ASRPQSPTTPWCLPRRYMKHKRDDGPEKQED 

EAVDVTPVMTCVFVVMCCSMLVLLYYFYDL 

LVYWIGXFCLASATGLYSCLAPCVKIU-PFGK 

C3RIPNNSLPYFHKRPQARN4LLI.ALFCVAVSV 

VWGVFRNEDQ 


1277 


2627 




10035 


51 




laKrX VrLrAl MAiabVARHLLFQSHMATKT 

TCMSSQGSDDEQIKRENIRSLTMSGHVQFESL 

PDQLVNRSIQQGFCFNILCVGETGIGKSTLIDT 

LFNTNFEDYESSHFCPNVKLKAQTYELQESN 

VQLKLTIVNTVGFGDQINKEERQLGRSQSTEN 

•» * i\A3lSV^rLr V CjTftJSS^ I or Vr 1V.LjA1AjIv WAlilh 

SSOQSAQQPYLPINSPPHRLADVADVHLFSSV 
LSGAFGCYHLDVTVNEFKKQQNRDEQEGYS 
KfinnRon^WK"Hf;Ar»PT nrinT7N)i 


1278 


2628 


A 


10036 


3 


457 


rafdvrrkkslrpccprdfhagcltvsgpSt 

vmgavgeslsvqcryeekyktfnkywcrqp 

clpiwhemvetogseowrsdqvntdhpodl 

TFTVTLENLTADDAOKYRfRIAnT OFrifW Srt 
FLPDPFFQVQVLVSSASSTENSVKTP 


1279 


2629 


A 


10039 


214 


435 


NDSLVPMSSWRSCARAPSSESAWRRSAATRR 
SRKCLRTICRKRWSSGKGTQMQSTLSETPRRA 
QMPCMWWYPFWO 


1280 


2630 


A 


10043 


2 


344 


RATWHNAGKEREAVQLMAGAEKRVKASHS 
FUlGLFGGmiUEEACEMYTOAANMFKMAK 
NWSAAGNAFCQAAKLHMQLQSKHDSATSFV 
DAGNAYKKADPQGKTARHVACYLCV 


1281 


2631 


A 


10080 


620 


818 


VIYKLDSSLFSYFIYFFIFETESHFLPLMKWTG 
PIMAHCSLKILASRNSADSAFLSAGEyrSLSHST 


1282 


2632 


A 


10084 


3 


1640 


SASinRGDKRASGEVGlAPSSRHILIGEPSAKY ' 

NGTADSLVRGPGILGEVTVFWRIFPPSVGEFA 

ETSGKLTMRDEQSAVIWIQALNDDIPEEKSF 

YEFQLTAVSEGGVLSESSSTANITWASDSPY 

GRFAFSHEQLRVSEAQRVNTniRSSGDFGHVR 

LWYKTMSGTAEAGLDFVPAAGELLIEAGEM 

RKSLHVElLDDDYPEGPEEFSLTrrKVELQOR 

GYDFTIQENGLQIDQPPEIGNISrVRniMKNDN 

AEGIIEFDPKVTAFEVEEDVGLIMIPWRLHGT 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 

m NO- 
IL/ I^V/. 

in 

USSN 
09/496 
914 


Predicted 

ucguinuig 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

!>c(|ucncc 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine C=Cysteine, 
L/=v\spanic Acio, liF^iutamic Acid, 
F=Phenylalamne, G=Glycine; H=Histidine, 
I^Isoleucine, K^Lysine, IM^eucine, 
M^^ethiraiine, N=Asparagine, F^Proline, 
Q==Ghitaniine, R^Aiginine, S^erine, 
T=Thre<Miine, V=Valinc, W=Tiyptophan, 
Y=<ryrosiiie; X=UnknowD, ♦=Stop codon, 
/^^ssible nucleotide deletion, \=po8sible 
nucleotide insertion 














YGYVTADnSQSSSASPGGVDYlLHGSTVTFQ 

HGQNLSFINISIIDDNESEFEEPIEILLTGATGG 

AVLljKHLVsiullAKiDSrFGVIRFLNQSKISIA 

NPNSTMILSLVUERTGGIXGEIQVNWErVGPN 

SQEALLPQNRDIADPVSGLFYFGEGEGGVRTO 

LTIYPHEEIEVEETFnKLHLVKGEAKLDSRAK 

DVTLTIQEFGDPNGWQFAPETLSKKTYSEPL 

ALEGPLLITFFVRRVKGTFOHM 


1283 


2633 


A 


10088 


316 


516 


MGSKTLPAPVPIHPSLQLTNYSFLQAVNGLPT 
VPSDHLPNLYGFSALHAVHLHQWTLGYPAM 
HLXRS 


1284 


2634 


A 


10091 


2 


569 


FVSPSRAMASAUYVSKFKSFVBLVVmiLLP 

LVnJvIPAKFVRCAYVIILMAIYWCTEVIPLAV 

TSUvlPVIXFPLFQILDSRQVCVQYMKDTNML 

FXX3GLIVAVAVERWNLHKRIAURTLLWVGA 

KPARLMLGFMGVTALLSMWISNTATTAMMV 

PIVEAILQQMEATSAATEAGLELVDKGKAKE 

LP 


1283 


2635 


A 


10092 


290 


728 


KQSTRPDVNfTLYPLHWQEEMSGESWSSAVP 
AAATRTTSFKGTSPSSKYVKLNVGGALYYTr 
MQlI.TKQimiLKAMFSGRMEVLTX)SEGWIL 
IDRCGKHFGTILNYLRDGAVPLPESRREIEELL 
AEAKYYLVQGLVEECQAALQV 






A 


lUlUU 


1 


574 


RPRGRGAWAGPGQDYSGVRRQQRRRTRISGS 

QRGSDAAGTMGCCTGRCSLICLCALQLVSAL 

ERQIFDFLGFQWAPILGNFLHnWILGLFGTIQ 

YRPRYIMVYTVWTALWVTWNVFIICFYLEVG 

GLSKIOT>LMTFNISVHRSWWREHGPGCVRR 

VLPPSAHGMMDDYTYVSVTGCIVDFQYLEVI 

HSA 


1287 


2637 


A 


10103 


232 


376 


RSRMGDKPIWEQIGSSFIQHYYQLFDNDRTQL 
GAIYVSFQL 


l^oo 




A 

A 


10 lu/ 


1 
1 


478 


MEEKUESRGKl'iiESGEDRGDGPPDRDPTLSPS 

AFILRAIQQAVGSSLQGDLPNDKDGSRCHGL 

RWRRCRSraSEPRSQESGOTDTATVLDMATD 

SFLAQLVSVLDPPDTWVPSRLDLRPGESEDM 

LELVAEVIUGDRDPIPLPVPSLLPRLRAWRTG 

KT 


1289 




A 


mi 17 




4 Jo 


LJ-itiKMPb i WKAOoIjKDPEIAELFFKEDPEKLFT 

DLREIGHGSFGAAYFARDVRTNEWAIKKMS 

YSO 


1290 


2640 


A 


10114 


367 


SS6 


RGAKAKSAVLPPGPPCSSIULSPPAPLTPRSPG 
lEATRPTAMSKSLKKKSHWTSKVHESVIGRN 

ESGSKLVSEELLLEVNETPVAGLTIRDVLAVI 

KHCKDPLRLKCVKQOESSOLLSVLPOGGTAR 

GAGQ 


1291 


2641 


A 


10116 


128 


591 


RTKBTERRSALSCSVLKSEPLPGLQPQASQQR 

RRRLPGRRQVQVQEGGGSGLRAWVLAMASV 

LGSGRGSGGLSSQLKCKSKRRRRRRSKRKDK 

VSILSTFLAPFKHLSPGITNTEDDDTLSTSSAE 

VKENRNVGNLAARPPPSGDRARGGATR 


1292 


2642 


A 


10121 


1 


749 


QRRRFRAGLWGGHGLTDGLRRNGGCGCSAR 

VPRVGHRLRGHRCPDPLCLLLDMLFLSFHAG 

SWESWCCCCUPADRPWDRGQHWQLEMADT 

RSVHETRFEAAVKVlQSLPKNGSFQPTNHvlM 

LKFYSFYKQATEGPCKXatPGFWDPIGRYKW 

DAWSSLGDMIXEEAMIAYVEEMKKIIETNIP 

MTEKVEELLRVIGPFYEIVEDKKSGRSSDrrSD 



322 



wo 01/57188 



PCT/USOl/03800 



SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
pq)tide 
seq- 
uence 


Met 

hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corre^XHiding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCystcine, 
D=AsDartic Acid. Ps^liitnrnir Aciii 
F=Phenylalaniiie^ G=<jlyciiie; I^Histidine; 
I=IsoJeucuie. K==Lvsine. I .=1 £iicine. 
M==Methionine,N==Aspaia^e, PMftoline, 
Q=OIutamine, R^Arginine, S=Serine, 
T=Threonine, V=Valine, W=ToT>tolAan, 
Y=Tyrosine. X^Unknown, *=Stop codon, 
A^x>ssible nucleotide deletion, V=possible 
nucleotide insertion 














LGNVLTSTPNAKTVNGKAESSDSGAESEEEE 
AC 


1293 


2643 


A 


10124 


2 


989 


PLMSLVRWEFVAASSAQKTPSRLENYYMVC 
KADEKFNQLVHFLKMHKQEKHLVFFRYSSGL 
pnpnre'n9Ai3MPQTr*Ar*i7i?w^vAT irm \rv 

V^VJ'VWU^JL'OARIVll^Ol wA^VJ&I X 0&^\LJi VL> Vfw 

GVKIMCjDHGKKlKYKRNKIFMEFRKLQSGILV 

CTDVMARGIDIFEVNWVLQYDPPSNASAFVH 

RC»RTAIUGHGGSALVFULPMBESYINFLAIN 

QKCPLQEMKPQKNTADLLPKLKSMALADRA 

VFEKGMKAFVSYVQAYAKHECasiLEFRLKDL 

DFASLARGFALLRMPKMPELRGKQFPDFVPV 

DVNTOTIPFKDKIREKQRQKLLEQQRREKTEN 

EGRRKFDCNKAWSKQKAKKK 


1294 


2644 


A 

' 


10129 


91 


1042 


VTMYKDCIESTGDYFLLCDAEGPWGIILESLA 
UL-Url V V 1 II ,1 ,1 .1 1 AJraJrXAlRKJQIXlSQWNVLPTQ 
LLFLLSVLGLFGLAFAFIIELNQQTAPVRYFLF 

mOAIGCSlI-QmATEYVTLIMTRGNJMFVN 

MTPCQLNVDFWLLVYVLFIAiALTFFVSKAT 

FCGPCEhWKQHGRUFITVLFSIIIWVVWISKIL 

LRGNPQFQRQPQWDDPWCIALVTNAWYFL 

LLYIVPELOLYRSCRQECPLQJGNACPVTAYQ 


1295 


264S 


A 


10133 


376 


518 


RPRWTHNSQWCa^PQDHPGWLPGQSOAPO 
GROAPRQEGPOSSWRQV 


1296 


2646 


A 


10135 


3 


551 


EWSLDPFMGIMSGQVGDLSPSQEKSLAQFRE 

"KJlOnvi GAT iixiDr\rwm i t> wn a n cT?r\T ^zrry 
yiJ y Lt^ALJrNiriJlJ X r U-a W L.ijA RN 1* 1 J| jljKS 

EDMUUCHMEFRKQQDLANILAWQPPEVVRL 

YNANGICGHDGEGSPVWYHIVGSQDPKGLLL 

SASKQELLRDSFRSCELLLRECELQSQKLGKR 

VRKITAIFriT VniTrl UriT WKT>nTPI T OP 


1297 


2647 


A 


10138 


48 


407 


mvsscx:gsvcsdqgcgqdlcqet(xrpscce 
ttccrttccrpsccvsscxaipqccqsvccxjpt 
csrpsccqttccrttcyrpsccvssccrpqcc 

QPVCCQPTCCRPSCCBrrCCHPXCC 


1298 


2648 


A 


101S6 


94 


453 


\i^ji^T<xsjjj\Sjmro\^vti\.krn^\3\^X Il^PiuiyiirJcO 
QEMYXJUrDQTTRRSPYRMSRILARHQLVTKI 
QQEEAKEACDWLJRAAGFPQYAQLYEDSQFP 
INIVAVKNDHDFLEKDLGEPLCRRLNT 


1299 


2649 


A 


10161 




393 


PRFSELVDGRGR V<! A R FOn<5PSlf A ATVn 

ASAQLENMEBAPKRVSLALQLPEHGSKDIGN 

VPGNCSENPCQNGGTCVPGADAHSCDCaPGF 

KGRRCELACKVSRPCTRLFSETKAFPVWEGG 

VCHHV 


1300 


26S0 


A 


10162 


98 


391 


AKIASLERINlPANYTCraPDGDNTDERYFIYA 
VTYTGIL<HmiONILALWVFYaYMKETKRA 
VIFMD4LAIADLLQVLSLPLRIFYYLKHDWPF 
VPV 


130r 


26S1 


A 


10163 


1 


7545 


PGIRVGrrSQTGLSSNLQENCSKLAFISSHGTE 

KQLQCMPMEGRGRASSSISDLQGKGFEKGTG 

EKHVPGVGSARHSPQASAGGSPWQRGKAQT 

RWLGKPDPGRKRRRGSPQEEGGLRVSAAAR 

LLCSGANRCKVLVRQNSTPNTQQPAVHPSTP 

PSRPLPQAGRCLVAPLRPHPDWVAAKrLAKA 

LRAPGKPWRLAAPSPLGDLGAPGLPGPSTAP 

RTLSVEEPGVECNQLCLYADVTDPVLaX3QK 

DPQVEGKHCEKEKISSSKELKHVHAKSEPSKP 

ARRLSESLHVVDENKNESKIEREHKRRTSTPV 

XMEGVQEETDTTRDVKRQVBRSHCrEEPQKQ 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SBQ 
ID NO: 
in 

.USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

corre^ndi 

ngtofiist 

amino acid 

residue of 

peptide 

sequence 


I^ictcdcnd 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A^AIanine C=Cystcine, 
I>=Aspartic Acid, E=Glutamic Acid, 
F=Phenyialanine, G=Glycine, IHIistidine, 
I=Isoleucine, K=Lysine, L^Ceucioe, 
M=^ethionine, N=Aspiaiigine, P=Pn)ltae, 
Q=Glutamine, R=A/ginine, S^Serine, 
T=Threonine, V=Valine, W=Tryptophan, 
Y=Tyn)sine, X=Uiiknown, *=Stop codon, 
/=possible nucleotide deletion, V^ossible 
nucleotide insertion 














KSTLK2^EKHLKKDDSE1VHLKSLLKKEVKSS 

KEKPEREKTPSEDKLSVKHKYKGDCMHKTG 

DETELHSSEKGLKVEENIQKQSQQTKLSSDDK 

TERKSKHRNERKLSVLGKDGKPVSEYIIKTDE 

NVRKENNKKERRLSAEKTKAEHKSRRSSDSK 

IQKDSLGSKQHGITLQRRSESYSEDKCDMDST 

NMDSNLKPEEVVHKHCRRTKSLLEEKLVLKS 

KSKTQGKQVKWETHXJEGATKQATTPKPD 

KEKNTEENDSEKQRKSKVEDKPFEETGVEPV 

LETASSSAHSTQKDSSHRAKLPLAKEKYKSD 

KDSTSTRLERKLSDGHKSRSLKHSSKDIKKKD 

ENKSDDKDGKEVDSSHEKARGNSSLMEKKL 

SRRLCENRRGSLSQEMAKGEaOAANTLSTP 

SGSSLQRPKKSGDMTLIPEQEPMEIDSEPGVE 

NVFEVSKTQDNRNNNSHQDIDSENMKQKTS 

ATVQKDELRTCTADSKATAPAYKPGRGTGV 

NSNSEKHADHRSTLTKKMHIQSAVSKMNPGE 

KEPIHRGTTEVNIDSETVHRMLLSAPSENDRV 

QKNLKNTAAEEHVAQGDATLEHSTNLDSSPS 

LSSVTVVPLRESYDPDVIPLFDKRTVLEGSTA 

STSPADHSALPNQSLTVRESEVLKTSDSKEGG 

EGFTVDTPAKASrrSKRHIPEAHQATLLDGKQ 

GKVIMPLGSKLTGVIVENENITKEGGLVDMA 

KKENDLNAEPNLKQTKATVENGKKDGIAVD 

HVVGLNTEKYAETVKLKHKRSPGKVKDISID 

VERRNENSEVDTSAGSGSAPSVLHQRNGQTE 

DVATGPRRAEKTSVATSTEOKDKDVTLSPVK 

AGPAriTSSETRQSEVALPCTSIEADEGUIGT 

HSRNNPLHVGAEASECTVFAAAEEGGAWTE 

GFAESETFLTSTKEGESGECAVAESEDRAADL 

LAVHAVKIEANVNSWTCEKDDAVTSAGSEE 

KCDGSLSKDSETVEGTITFISEVESDOAVTSAO 

TEIRAGSISSEEVDGSQGNMMRMQPKKETEG 

TVTCTGAEGRSDNFVICSVTGAGPREERMVT 

GAGWLGDNDAPPGTSASQEGDGSVNDGTE 

GESAVTSTOrrEDGEGPASCTGSEDSSEGFAIS 

SESEENGESAMDSTVAKBGrNVPLVAAGPd) 

DEGIVTSTGAKm)EEGEDWTSTGRGNEIGH 

ASTCTQLOBESEOVLICESAEODSQIGTWEH 

VEAEAGAAIMNAKENNVDSMSGTEKGSKDT 

DICSSAKGrVESSVTSAVSGKDEVTPVPGGCE 

GPMTSAASDQSDSQLEKVEDTTISTGLVGGS 

YDVLVSGEVPECEVAHTSPSEKEDEDirrSVE 

NEECDGLMATTASGDITNQNSLAQGKNQGK 

VLnSTSTTNDYTTQVSAITDVEGGLSDALRTE 

ENMEGTRVTTEEFEAPMPSAVSGDDSQLTAS 

RSEEKDECAMISTSIGEEFELPISSATTIKCAES 

LQPVAAAVEERATGPVUSTADFEGPMPSAPP 

EAESPLASTSKEEKDECALISTSL\EECEASVS 

GVWESENBEIAGTVMEEKDQSGIISTSSVEDC 

EGPVSSAVPQEEODPSVTPAEavlGDTAMISTS 

TSEGCEAVMIGAVLQDEDRLTrrRVEDLSDA 

AIISTSTAECMPISASIDRHBENQLTADNPEGN 

GDLSATEVSKHKVPMPSLIAENNCRCPGPVR 

GGKEPGPVLAVSTEEGHNGPSVHKPSAGQGH 

PSAVCAEKEEKHQKECPEIGPFAGRGQKESTL 

HLINAEEKNVLU^SLQKEDKSPETGTAGGSST 

ASYSAGRGLEGNANSPAHLRGPEQTSGQTAK 

DSSVSSIRYLAAVNTGAIKADDMPPVQGTVA 

EHSFLPAEQQGSEDNLKTSnxaTGQESKIAP 
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SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO; of 
peptide 
seq- 
uence 


Met 
bod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

coriespondi 

ng to first 

amino acid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A=Alanine OCystcinc, 
I>=Aspartic Acid, E^GIutaniic Acid, 
F=PhenyIalanine, G=Glycine, H=Histidine, 
Msoleucine, K=Lysine, L=Leucine, 
M^^ethionine, N=Aspaiagine, P=4*roline, 
QNjIutainine, R-Aiginine, S=Serine, 
T=Tlireonine, V=Valine, W=Tiyptophan, 
Y=<riTosine, X=Unknown, *=Stop codon, 
/^possible nucleotide deletion, V=possible 
nucleotide insertion 














SHTMIPPATYSVALLAPKCEQDLTIKNDYSGK 

WTDQASAEKTODDNSTRKSFPEEGDIMVTVS 

SEENVCDIGNEESPLNVLOOLKLKANLKMEA 

YVPSEEEKNGEH-AFPESLCGGKPSGIAELQRE 

PLLVNESLNVENSGFRTNEEIHSESyNKGBISS 

GRKDNAEAISGHSVEADPKEVEEEERHMPKR 

KRKQHYLSSEDEPDDNPDVLDSRIETAQRQC 

PETEPHATKEENSRDLEELPKTSSEmSTTSRV 

MEEKDEYSSSETTGEKPEQNDDDTIKSQE 


1302 


2652 


A 


10167 


321 


842 


EPSLFPFLRPSPARPPPRPPAPFPSPELAOPEPH 

FVFYFFLSYVHPPKELAKYEYMEEQVILTEKG 

NSTVAGRGTSVRCLSPSPRPLPPLLPLLADLLE 

DGFGEHFFYHCLVAEVPKEHWTTEGNPSRFP 

EARETKCYVRSSVGCVEPLTTQAEVTEba,DR 

KNSQQVFKLLKKK 


1303 


2653 


A 


10171 


206 


429 


NMILLKKRRLLINSLGEGTlNGi.LDIiLLETNV 
LSQEDTEIVKCENVTVIDKARDLLDSVIRKGA 
RACEICITYI 


1304 


2654 


A 


10184 


970 


1524 


LCTLSPGISGTAGSCLTTEPGTELGTSFAQNGF 

YHEAVVLFTQALKLNPQDHRLFGNRSFCHER 

LGQPAWALADAQVALTLRPGWPRGLFRLGK 

ALMGLQRFREAAAVFQETLRGGSQPDAAREL 

RSCLLHLTLQGQRGGICAPPLSPGALQPLPHA 

ELAPSGLPSLRCPRSTALRSPGLSPLLH 


1303 


2655 


A 


10194 


2 


394 


TDLLGRRFRVDGAAMAACEGRRSGALGSSQ 

SDFLTPPVGGAPWAVATTVVMYPPPPPPPHR 

DFISVTLSFGESYDNSKSWRRRSCWRKWKQL 

SRLQKNMILFLLAFLLFCGLLFYINLADHWKG 

IRNTCT 


1306 


2656 


A 


10195 


1 


410 


IPGSnSLEGPLSKWTNVMKGWQYRWFVLDY 

NAGLLSYYTSKDKNIMRGSRRGCVRLRaAVl 

GIDDEDDSTFTITVDQKTFHFQARDADEREK 

WIHALEETILRHTLQLQVRVFTWFPDSSLVGA 

FFFWLVSGFFFK 


1307 


2657 


A 


10205 


85 


308 


QGLPSTMVKLOCSFSGKPGKDPODQDGAAM 
DSVPLISPLDISQLQPPLPDQWDCTQTEYQLS 
SPDQQNYTKSR 


1308 


2658 


A 


10214 


2 


453 


ECGGIRQPGPGPPPALASAPAATMNRVGGSPS 
AAANYLLCTNCRKVLRKDKRIRVSQPLTRGP 
SAFIPEKEWQANTVDERTNFLVEEYSTSGRL 
DNTTQVMSIOTQYLESFLRSQFYMLRMDGPL 
PLPYRHYIAIMAAARHQCSYLINM 


1309 


2659 


A 


10233 


45 


421 


RGWPEQQSTGRPRDVARQPRCQKEEGRRUO' 

RALESRITQGSHISRWGPPLESTKENVQCGH 

RPAFPNSSWLPFHERLQVQNGECPWQVSIQM 

SRKHLCOGSILHWWWVLTAAHCFRRTLLDM 

AV 


1310 


2660 


A 


10241 


243 


442 


AFQLFNAKCESAFLSKRNPLQRNWTVLYRRK 
HKKGQSAEIQKKRTRRAFKFQRATTGASLADI 
MAK 


1311 


2661 


A 


10261 


751 


176 


LPQADYGGGHLSLRLFHLLLTSAAWVPDESQ 

VTLNSAICVLSTVLIMEFPDLGKHCSEKTCKQ 

LDFLPVKCDACKQDFCKDHFPYAAHKCPFAF 

QKDVHVPVCPLCNTPIPVKKGQIPDVWGDHI 

DRDOJSHPGKKKEKIFTYRCSKEGCKKKEML 

QMVCAQCH<a«JFClQHRHPLDHSCRHGSRPn 

KAG 


1312 


2662 


A 


10270 


3 


669 


STSSDEGSPSASTPMINKTGFKFSAHKPVIEVP 
SMTBLDKKDGEQAKALFEKVRKFRAHVEDSD 
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SEQ ID 

JNU. til 

nucl- 
eotide 
seq- 
ttoice 


SEQ ID 

pqjtide 

seq- 

uence 


Met 

nuu 


SEQ 

UJ JNU. 

in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 

correspondi 

ng to first 

sroino 8cid 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresptHiding 
to last amino 
acid residue 
of p^Jtide 
sequence 


Amino acid sequence (A=Alanine OCysteine; 
L>=Aspantc Acid, E=Glutamic Acid, 
F=PiienylalaninB^ G=<jlyciiiei H^Histidine, 
I=IsoIeudne, K=Lysine, L^^jeucine, 
Kt^ethionine, N=Asparagine, P=Proline, 
Q-Ghitamine, R-Arginine, S-Serine, 
i^ifireoDine, V— Valine, w=liyptopiian, 
y=<r)rn)sinei X^Unknown, *=Stop codon, 
>^>ossible nucleotide deletion, V=possible 
nucleotide insertion 














Lrra^YWQTVnCTAKFIFILCYTANFVNAlSF 

Em^aCPKVEHUOYEVFECIHNMAYMLKKL 

LISYISnCVYQn<XYTLFWLFRIPLKEYSFEKV 

REESSFSDBPDVKNDFAFLLHMVXtQYDQLYS 

KW^CVFI^EVSENKIJUEISLNHEWTFEKL 


1313 


2663 


A 


10287 


1221 


266 


GAHRVLSPAQGAQPRLRSAASVEVSMVGQR 

VLIXVAELL^GVlXSEAAKILTISTLXjGSHyLL 

LDRVSQILQEHGHNVTMLHQSGKFLIPDIKEE 

EKSYQVIRWFSPEDHQKRIKKHFDSYIETALD 

GRKESEALVKLMEIFGTQCSYLLSRKDIMDSL 

KNENYDLVFVEAFDFCSFLIAEKLVKPFVAIL 

PTTFGSLDFGLPSPLSYVPVFPSLLTDHMDFW 

GRVKhJFLMFFSFSRSQWDMQSTFDNTDCEHF 

PEGSRPVLSHLLLKAELWFVNSDCAFDFARPL 

LPNTVYIGGLMEKPKPVPQVSEPSAFSLGFT 


1314 


2664 


A 


10288 


536 


1890 


NVQLAKFSS-lLVm-SCOADPSALAKYVLAL 

VKKDKSBKELKALCIDQLDVFLQKETQIFVEK 

LFDAVNTKSYLPPPEQPSSOSLKVEFFPPQEK 

DDCKEETTKEEEREKKFSRRLNHSPPQSSSRYR 

ENRSIU)ERKKDDRSRKRDYDRNnPPRRDSryRD 

RYNRRRGRSRSYSRSRSRSWSKERLRERDRD 

RSRTRSRSRTRSRERDLVKPKYDLDRTDPLEN 

NYTPVSSVPSISSGHYPVPTI^STITVIAPTHHG 

NNTTESWSEFHEDQVDHNSYVRPPMPKKRC 

RDYDEKOTCMRQDMCPFDHGSDPVVVEDVN 

LPGMQPFPAQPPWEGPPPPQLPPPPPILTPPPV 

NLRPPVPPPGPLPPSLPPVTGPPPPLPPLQPSG 

MDAPPNSATSSVPTWTTGIHHQPPPAPPSLFT 

AirrYDTDGYNPEAPSrTNTSRPMYRHRVHPR 

AKLG 


1313 


2663 


A 


10293 


447 


1331 


SHPLLSCTEKYSAKLRAAAEAAAEERRTRGA 

GSRGICAGLRSVAPGPEPLKQEEGRREWdSSI 

GTPSPCGSAQAAAAAAAEEATEKIPALRPALL 

WALLALWLCCATPAHALQCRDGYEPCVNEG 

MCVTYHNGTGYCKCPEGFLGEYCQHRDPCE 

KNRCQNGGTCVAQAMLOKATCRCASGFTGE 

IX;QYaIaHPCIFVSRP(XNGGTaiMLSRDTYE 

CrCXJVGFTGRNPKCPGGNLNYQFNGnWYS 

GGSVPPSGTKTSKPAEHNAMGTGSKNFASGT 

LWVMVSGATSTSTSTL 


1316 


2666 


A 


10294 


118 


572 


SLaiESNHKSGDGLSGTQKEAALRALVQRTG 

YSLVQENQQRKYGGPPPGWDAAPPERGCEin 

GKIPRDIJEDELIPLCEKIGKIYEMRMMMDF 

NGNNRGYAFVTFSNKVE/UCNAIKQLNNYEIR 

NGRLLGVCASVDNCRLFVG01PK.TKK 


1317 


2667 


A 


10301 


158 


1956 


UXSCGVLLSGVOPCEGKGFrVLVIQTAVPQ 

DRPTKSSMRSAAKPWNFAIRAOGHGPDRVRP 

LPAASSGMKSSKSSTSLAFESRLSRLKRASSE 

DTLNKPGSTAASGWRUCKTATAGAISELTES 

RLRSGTGAFTTrKRTGIPAPREFSVTVSRERSV 

KQENEGGEKAALESQVRELLAEAKAKDSEIN 

RLRSELKKYKEKRTLNAEGTDALGPNVDGTS 

VSPGDTEPMIRALEEKNKNFQKELSDLEEENR 

VLKEKLIYLEHSPNSEGAASHTGDSSCPTSITQ 

ESSFGSPTGNQLSSDIDEYKiCNIHGNALRTSG 

SSSSDVTKASLSPDASDFEHITAETPSRPLSSTS 

NPFKSSKCSTAGSSPNSVSELSLASLTEKIQKM 

EHfflHSTAEELQATLQELSDQQQMVQELTAE 
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SEQK) 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
in 

USSN 
09/496 
914 


Predicted 

ucginiung 

nucleotide 

location 

comespondi 

ng to fust 

flminn of*ifl 
fllllUlU OUIU 

residue of 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corre^ading 
to last amino 
acid residue 
of peptide 
sequence 


Amino add sequence (A=Alaiiine OCysteine, 
I>=Aspartic Add, EKjlutamic Add, 
F=Phenylalamne, G=Glycine, H=Histidine, 
t=Isoleucine, K=Lysine, LrLeucine, 
K^^ethionine, N=-Aq>aiagine, P=Pro!inc. 
Q=Glutamine, R^Aiginine, S=SeriD«v 
1— inreonine, V=VaIinei W=Tiyptopnaii, 
Y=Tyrosine, X^Unknown, ♦=Stop codon, 
A^xissible nucleotide ddetioo, ^^ossible 
nudeotide insertion 














NEKLVDEKTILETSFHQHRERAEQLSQENEKL 

MNLLQERVKNEEPTTQEGKIIELEQKCTGILE 

QGRFEREKLLNIQQQLTCSLRKVEEENQGAL 

EMKRUCEENEKU^EFLELERHNNNMMAKTL 

EECRVTLEGLKMENGSLKSHLQG 


1318 


2668 


A 


10303 


333 


879 


UliCl'IMAAWQQNDLVFEFASNVMEDERQL 

ODPAIFPAVIVEHVPOADILNSYAGLACVEEP 

NDMrnESSLDVAEEEDDDDDDDITLTSrEASCH 

DGDEnETTEAAEALLNMDSPGPMIJDEKRINN 

NIFSSPEDDMWAPVTHVSVTLDGIPEVMETQ 

QVQHCYADSPGASSPEQPKRKKK 


1319 


2669 


A 


10322 


169 


654 


MEVRMSGSVAVTRAIAVPGLLILLIIATALSL ' 

LIGAKSLPASWLEAFSGTCQSADCnVLDAR 

LPRHAGLLAGGALGLAGALMQTLTRNPLAD 

PGLLGVNAGASFATVLGAALFGYSSAQEQLA 

MAFAGALVASLIVAFTGSQGGGQLSPVRLTL 

AGVXL 


1320 


2670 


A 


10323 


441 


2 


KMNQVAWIGGGQTLGAFLCHQLAAEGYRV 

awdiosokaanvaqeinaeygesmayofg 
adatseqsvlalsrgvdeifgrvdllvysagi 
akaafisdfqlgdfdrslqvnlvgyflcare 
fsrlmirdgiqgrnqinsksde 


1321 


2671 


A 


10332 


1 


453 


rhrtaopgstissrtdsasapaarampceyty 

akltsdcsrpslqwytraqskmrrprlllkd 

oxctllwgvrjolyilklnytteeaamknmh 

yvdpdhvkraqkyaqqvlqkesppkfakts 

mallfehrysvdllpfvqkaptdsea 


1322 




A 
A 






423 


EPSNGPWYSALONtiUUlilLLLGKDIIGTFAAS 

ERKMRAHQVLTFUXFVrrSQASENASTSRGC 

GLDLLPQNVYLCDU5AIWGIVVEAVAGAGA 

LITLLLMULLGRIPFIKEKEKKSPAVLHFLFL 

LGTLG 


1323 


2673 


A 


10334 


52 


426 


SSLGNEDDElLSLAKDrrGMFVASHRKMRAH 
QVLTFUXFVrrSVASENASTSROCOLDLLPQ 
YVSLCDLDAIWGIWBAAAGAGAUTLLLMU 
IXVRLPFFKKKEKKSPVGUIFLFLLGTLGP 


1324 


2674 


A 


10336 


1 


932 


KRLCFPCMQSKIYSYMSPNKCSGMRFPLQEE 

NSVTHHBVKCQGKPLAGIYRKRBEKRNAGN 

AVRSAMKSEEQKIKDARKGPLVPFPNQKSEA 

AEPPKTPPSSCDSTNAAIAKQALKKPIKGKQA 

PRKKAQGKTQQNRKLTDFYPVRRSSRKSKAE 

LQSEERKRIDELIESGKEEGMKIDLIDGKGRG 

VIATKQFSRGDFWEYHGDUEITDAKKREAL 

YAQDPSTGCYMYYFQYLSKTYCVDATRBTN 

RLORLTNHSKCONCQTKLHDIDGVPHLILIAS 

RDIAAGEELLYDYGDRSKASIEAHPWLKH 


1325 


2675 


A 


10338 


3 


870 


PGSnsCSELKGTQCRATAGSRGRRPPMTCWL" 

RGVTATFGRPAEWPGYLSHLCGRSAAMDLG 

PMRKSYRGDREAFEETHLTSLX»PVKQFAAWF 

EEAVQCPDIGEANAMCLATCTRDGKPSARML 

lAKGFGKDGFRFFTNFESRKGKELDSNPFASL 

VFYWEPLNRQVRVEGPVKKLPEEEAECYFHS 

RPKSSQIGAWSHQSSVIPDREYLRKKNEELE 

QLYQDQEVPKPKSWGGYVLYPQVMEFWQG 

QTh®IJIDRIVFRRQUTX}DSPU3PMrHRGEE 

DWLYERLAP 


1326 


2676 


A 


10344 


2 


984 


ARAAAHCGICRLVRWWRKRRSVMGIQTSPV 
IXASLGVGLVILLGLAVGSYLVRRSRRPQVT 
IXDPNEKYIiRLLDKTTVSHNTKRFRFALPTA 
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SEQID 
NO: of 
nuci- 
eotide 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq. 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted. 

nucleotide 
location 
oonespondi 
ng to first 
amino acid 
residue of 
pq>tide 


Predicted end 

location 
corresponding 
to last amino 
acid residue 
of pq>tide 
sequence 


Amino add sequence (A^Alanine 0=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=?henylalanine, G=GIycine, H^IQstidine, 
h=Isoleucine, K=Lysine, L=Leucine; 
M=Methionine, N^Asparagine, P^Proline, 
Q=<jlutamine, R=Arginine, S=<Serine, 
T=Threonin6, V=Valine, W=Tiyptopiian, 
V=Tyrosinc, X=Unfcnown, ♦=Stop codon, 
/=70ssible nucleotide deletion, \=possible 
nucleotide msertion 














HHTLa,PVGKHIYLSTRIDGSLVIRPYTPVTSD 

EDQGYVDLVDCVYLKGVHPKFPEGGKMSQY 

LDSLKVGDWEFRGPSGLLTYTOKGHFNIQP 

NKKSPPEPRVAKKLOMIAOGTGITPMLQLIRA 

ILKVPEDPTQCFLLFANQTEKDIILREDLEELQ 

ARYPNRFKLWFTLDHPPKDWAYSKGFVTAD 

MIREHLPAPGDDVLVLLCGPPPMVQLACHPN 

UJKLGYSQKMRFTY 


1327 


2677 


A 


10345 


1 


968 


LQSAGEGVTHVLILLESPARPVAAVTQVQRR 

RYHRLSDMSMLAERRRKQKWAVDPQNTAW 

SNDDSKFGQRMLEKMGWSKGKGLGAQEQG 

ATOHDCVQVKNNHLGLGATINNEDNWIAHQ 

DDraQLLAELNTCHGQETroSSDKKEKKSFS 

LEEKSKISKNRVHYMKFTKOKDLSSRSKTDL 

IXaFGKRQSKKTFEODASPSTPEENETTTTSAF 

■nQEYFAKRMAALKNKPQYPVPGSDISETQVE 

RKRGKKKNKEATGKDVESYLQPKAKRHTEG 

KRERAEAQERVAKKKSAPAEEQLRGPCWDQ 

SSKASAQDAGDHVQPA 


1328 


2678 


A 


10346 


173 


439 


GSAAMKVKIKCWNQVATWLWVANDENCGI 
CRMAFNGCCPDCaCVPGDDCPLVWGQCSHCF 
HMHOLKWLHAQQVQQHCPMCRQEWKFKE 


1329 


2679 


A 


10351 


3 


964 


QMEPGNDTQISEFLLLGFSQEPGLQPFUFGLFL 

SNm-VTVLGNLLIILATISDSHLHTPMYFFLSN 

LSFADlCVTSTTIPKMLMNIQTQNKVrrYIACL 

MQMYFFILFAGFENFLLSVMAYDRFVAICHP 

LHYMVIMNPHLCGLLVLASWmSALYSLLQI 

LMWRLSFCTALEIPHFFCELNQVIQLACSDSF 

l^IHMViyFTVALLGGGPLTGILYSYSKHSSIH 

AISSAQGKYKAFSTCASHLSWSLFYGAILGV 

YLSSAATRNSHSSATASVMYTVVITMLNPFI 

YSLRNKDIKRALGlHLLWGTMKGQFFFaCCP 


1330 


2680 


A 


10332 


34 


2573 


IPFLKSCX:C(XLFDFPPPPLDQVQEEECEVERV 

TEHGTPKPFRKFDSVAFGESQSEDEQFENDLE 

TDPPNWQQLVSREVLLGLKPCEKRQEVINEL 

FYTERAHVRIUKVLDQVFYQRVSREGILSPSE 

LRKIFShlLEDILQLHIQLNEQMKAVRKRNETS 

VIDQIGEDLLTWFSGPGEEKLKHAAA.TFCSNQ 

PFALEMKSRQKKDSRFQTFVQDAESNPLCRR 

LQLKDIIPTQMQRLTKYPLLLDNlATYreWPT 

ERHCVKKAADHCRQILNYVNQAVKEAENKQ 

RLEDYQRRLDTSSLKLSEYPNVEELRNLDLTK 

Ria4IHEOPLVWKWlIU>Kra>LYTLLLEDILV 

LLQKQDDRLVLRCHSKILASTADSKHrFSPVl 

KLSTVLVRQVATDNKALFVISMSDNGAQIYE 

LVAQTVSEKTVWQDUCRMAASVKEQSTKPI 

PLPQSTPGEGDNDEEDPSKLKEEQHGISVTGL 

QSPDRDLGLESTLISSKPQSHSLSTSGKSEVRD 

U=VAERQFAKEQHTDG1LKEVQEDYQIAIPDS 

HLPVSEERWALDALRNLGLLKQLLVQQLGLT 

EKSVOEDWOHFPRYRTASOGPOTnSVmTJSF 

NKAYHSGEGHMPFRTGTGDIATCYSPRTSTE 

SFAPRDSVGLAPQDSQASNILVMDHMIMTPE 

MFTMEPEQGLDDSGEHFFDAREAHSDENPSE 

GDGAVNKEEKDVNLRISONYIJLDGYDPVQE 

SSTDEEVASSLTLQPMTGIPAVESTHQQQHSP 

QNTHSDGAISPKIVEI'LVQQRWGAMEYSCFEI 

QSPSSCADSQSQIMBYIHKIEADLHttJKKVEE 

SYTILCQRLAGSALTDKHSDKS 



328 



wo 01/57188 



PCT/USOl/03800 



SEQID 
NO: of 
aucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/4% 
914 


Predicted 

nucleotide 
location 
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to last amino 
acid residue 
of peptide 
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) Amino acid sequence (A^Alanine C=Cysteine, 
D=Aspartic Acid, &=Ghitamic Acid, 
P=Phenylalanine, G^Glycine, H=Histidine, 
Msoleucbe, K=4-ysine, L=Leucine, 
M=*<etluonine,N=Asparagine, P=Proline, 
(}=Glutaminc, R=Argininc, S=Serinc, 
T=Threoninc, V=Valine, W=Tiyptophan, 
Y=^T>rosine, X=Unlcnown, ♦=Stop codon, 
A^ssible nucleotide deletion, V^ossible 
nucleotide insertion 


1331 


2681 


A 


10353 


1 


2100 


AVEFAEGALTMAPWPELGDAQPNPDKYLEO 

AAOQQPTAPDKSKETNKTDNTEAPVnCIELLP 

SYSTATLIDEPTEVDDPWNLPTLQDSGIKWSE 

RDTKGKELCFFQGIGRLILLLGFLYFFVCSLDIL 

SSAFQLVGGKMAGQFFSNSSIMSNPLLGLVIG 

VLVTVLVQSSSTSTSIWSMVSSSLLTVRAAIP 

nMGANIGTSmmVALMQVGDRSEFRRAFA 

OATVHDFFNWLSVLVLLPVEVATHYLEIITQL 

rVESFHFKNGEDAPDLLKVTTKPFTBaJVQLDK 

KVlSQIAMNTOEKAraJKSLVKIWacrFTNKTQ 

nmwSTANCTSPSLCWTDGIQNWTMKNVT 

YKENIAKCQHIFVNFHLPDLAVGTILLILSLLV 

LCG(XIMIVKILGSV1,KGQVATVIKKTINTDFP 

FPFAWLTGYLAILVGAGMmVQSSSVFTSAL 

TPLIGlQmiERAYPLTLGSNIGTTTTAILAAL 

ASPGNALRSSLQIALCHFFFNISGnXWYPIPFT 

RLPIRMAKGLGNISAKYRWAVFYLnFFFLIP 

LTVFGLSLAGWRVLVGVGVPWFniLVLCXK, 

IXQSRCPRVIJKKLQNWNFLPLWMRSLKPW 

DAWSKFTOCFQMRCCCCCRVCXaiACCLLC 

GCPKCCRCSKCCEDLEEAQEGQDVPVKAPET 

FDNmSREAQGEVPASDSKTECTAL 


1332 


2682 


A 


10354 


30 


1377 


SQQGSQPHRQGPPSLLTAPHSLDLPAJLPPGPR 
GSQGKLRRVLVPMSVKPSWGPGPSEGVTAVP 
TSDLGEfflNWTELLDLFNHILSECHVELSQST 

krwlfalylamfwglvenllvicvnwrg 

sgraglmnlyilnmaiadlgivlslpvwmle 

vtldytwlwgsfsoifthyfyfvnmyssiff 

lvclsvdryvtltsaspswqryqhrvrram 

cagiwvlsailplpewhiqlvegpepmelfm 

apfetystwalavalsttilctllpfplervfn 

vltacrlrqpoqpksrrhclllcayvavfv 

mcwlpyhvtlllltlhgthislhchlvhlly 

ffydviix:fsmlhcvinpilynflsphfr,grll 

NAWHYLPKDQTKAGTCASSSSCSTQHSnrr 

KGDSQPAAAAPHPEPSLSFQAHHLLPNTSPISP 

TQPLTPS 


1333 


2683 


A 


I03SB 


2 


884 


AAGAGADGREPASERASRAEPPAVAMGQND 

LM(j 1 AJbUr ADQFlJtVTKQYIJ'HVARLCXlST 

FLEDGIRMWFQWSEQRDYIDTTWNCGYLLA 

SSFVFLNllXJQLTGCVLVLSRNFVQYACFGLF 

GHALQTUYSILWDLKFIJvIFNLALGGGLLIX 

LAESRSEOKSMFAOVPTMRESSPKQYMQLGO 

RVLLVlivlFMTLLHFDASFFSIVQNlVGTALMI 

LVAIGFKTKLAALTLWWLFAINVYFNAFWT 

IPVYKPMHDFLKYDFFQTMSVIGGLLLWAL 

GPGGVSMDEKKKEW 


1334 


2684 


A 


10367 


59 


1562 


QAWSLQVALSPtWPASFSNSFAAAVPQLLFP 

ELPLPHVPGQESAKRRSARRFLIMSEL'rKEIivl 

ELVWGTKSSPGLSDTIFCRWTQGFVFSESEGS 

ALEQFEGGPCAVIAPVQAELLKKLLFSSEKSS 

WRDCSQEEQKELLCHTLCDILESACCDHSGS 

YCLVSWLRGKTTEETASrSGSPAESSCQVEHS 

SALAVEELGFERFHAUQKRSFRSU»ELKDAV 

LDQYSMWGNKFGVLLFLYSVLLTKGIENIKN 

EIEDASEPUDPVYGHGSQSLINUXTGHAVSN 

VWDGDRECSGN4KLLGIHEQAAVGFLTLMEA 

LRYCKVGSYLKISKIPYLDCLASETHLTVFFA 

KDMALVAPEAPSEQARRVFQTYDPEDNQFIP 

DSIXEDVMKAU5LVSDPEYINLMKNKLDPEO 
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nucl- 
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seq- 
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SEQ ID 
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seq- 
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SEQ 
ID NO: 
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nucleotide 

location 

coirespondi 

ng tD first 

amino acid 
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sequence 
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nucleotide 
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to last amino 
add residue 
of peptide 
sequence 


Amino add sequence (A^AIaoine 0=Cysteine, 
D=Aspartic Acid, E=Glutamic Acid, 
F=4>benylalanine, G=Glycine, H^Histidine, 
I=Isoleucinc, K=Lysine, L^=Leucine, 
KNMetbiooine, N=Asparagine, P=ftx)line, 
Q=<}hitainine, R=Aiginine, S=Serine, 
T=Thrconine, V=Valine, W=Tryptophan, 
Y=Tyrosine, X=Unknown, *=Stop codon, 
/=^ssible nucleotide deletion, V=possible 
nudcotide insertion 














LGmXQPFLQEFFPDQGSSGPESFTVYHYNGL 
KQSNYNEKVMYVEGTAWMGFEDPMLQTD 
DlTKRCLQTKWPYimXWTTDRSPSLN 


1335 


2685 


A 


10375 


82 


2929 


TRTKRRLGREKAMASPPRGWGCGELLLPFML 

LGTLCEPGSGQIRYSMPEEUDKGSFVGNIAKD 

LGLEPQELAERGVRIVSRGRTQLFALNPRSGS 

LVTAGRIDREELCAQSPLCVVNFNILVENKM 

KIYGVEVEIIDINDNFPRFRDEELKVKYNENA 

AAGTRLVLPFARDADVGVNSLRSYQLSSNLH 

FSLDWSGTDGQKYPELVLEQPLDREKETVH 

DIXLTALDQGDPVLSGTTHIRVTVLDANDNA 

PLFTPSEYSVSVPENIPVGTia-LMLTATDPDE 

GINGKLTYSFRNEEEKISETFQLDSNLGEISTL 

QSLDYEESRFYLMEWAQDGGALVASAKW 

VTVQDVNDNAPEVILTSLTSSISEDCLPGTVU 

LFSVHDGDSGENOEIACSIPKNLPFKLEKSVD 

NYYHLLTTRDLDREETSDYNTIXTVNIDHGTP 

PLSTESHIPLKVADVNDNPPNFPQASYSTSVT 

ENNPRGVSIFSVTAHDPDSGDNARVTYSLAE 

DTFQGAPLSSYVSINSDrGVLYALROTDYEQL 

RDLQLWVTASDSCTIPPLSSNVSLSLFVLDQN 

DNTPEILYPALPTDGSTGVELAPRSAEPOYLV 

TKWAVDKDSGQNAWLSYRLUCASEPGLFA 

VGLHTGEVRTARALLDRDALKQSLWAVED 

HGQPPLSATFTVTVAVADRIPDILADLGSIKTP 

IDPEDLDLTLYLWAVAAVSCVFLAFVrVLLV 

LRLRRWHKSRLLQAEGSRLAGVPASHFVGV 

DGVRAFLQTYSHEVSLTADSRKSHLIFPQPNY 

ADTLLSEESCEKSEPLLMSDICVDANKEERRV 

QQAPPNTDWRFSQAQRPGTSGSQNGDDTGT 

WPNNQFDTEMLQAMILASASEAADGSSTLGG 

GAGTMGLSARYGPQFTLQHVLQGELGSDYR 

QNVYIPGSNATLTNAAGKRDGKAPAGGNGN 

KKKSGKKBKK 


1336 


2686 


A 


10379 


1 


337 


RPRRRQPSFSCRVLVLEDPPCFRFTOSMNQEK 
LAKLQAQVRIGGKGTARRKKKWHRTATAD 
DKKLQSSLKKLAVNNIAGIEEVNNIIKDDGTVI 
HFNNPKVQASLSANTFAITGHAEAKPITEMLP 
GILSQLGADSLTSUUCLAEQFPRQVLDSKAPK 
PEDIDEEDDDVPDLVENFDEASKNEAN 


1337 


2687 


A 


10380 


1 


1263 


IPGSTISWSPAAARGLSVCaiCeRLHPASAMDL 

FGDLPEPERSPRPAAGKEAQKGPLLFDDLfPA 

SSTDSGSGGPLLFDDLPPASSGDSGSLATSISQ 

MVKTEGKGAKRKTSEEEKNGSEELVEKKYC 

KASSVIFGUCGYVAERKGEREEMQDAHVILN 

DITEECRPPSSUTRVSYFAVFDGHGGIRASKF 

AAQNLHQNLIRKFPKODVISVEKTVKRCLLD 

TFKHTDEEFIXQASSQKPAWKDGSTATCVLA 

VDNILYIANLGDSRAILCRYNEESQKHAALSL 

SKEHNPTQYEERMRIQKAGGNVRDGRVLGV 

LEVSRSIODGQYKRCGVTSVPDIRRCQLTPND 

RFILLACDQLFKVFTPEEAVNFILSCLEDEKIQ 

TREGKSAADARYEAACNRLANKAVQRGSAD 

NVTVMWRIGH 


1338 


2688 


A 


10385 


3 


589 


GPSQSMAAGELEGGKPLSGLLNALAQDTFHG 

YPOrraELLRSQLYPEVPPEEFRPFLAKMRGIL 

KSIASADMDFNQLEAFLTAQTKKQOGITSDQ 

AAVISKFWKSHKTKIRESLMNQSRWNS(2LRG 

LSWRVIMKSQSRHSAQMrPVAIIELELGKYG 

QESEFLCLEFDEVKVNQILKTLSEVEESISTLIS 
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Amino acid sequence (A=Alaniiie OCysteine, 
D=Aspartic Acid, £=Glutainic Acid, 
F=Pli6nylaIamnB, GKJlycine, IHSstidine, 
I=Isoleucine. K.=LysinB, L=Leucine, 
M=^ethloaine, N=Asparagiiie, l^Proline, 
Q=Glutaniine, R=AiBiiiinei, S=Serine, 
T=Threonine, V=Valine, W=TryptDphan, 
Y=Tyrosiiie, X=-Unlaiowii, *=Stop codon, 
A=possiblc nucleotide deletion, V^ssiblc 
nucleotide iDse][tion 














QPN 


1339 


2689 


A 


I03S6 


50 


390 


LGAMAKHHPDLIFCRKQAGVAIGRLCEKCDO 
KCVICDSYVRPCTLVRICDECa^GSYQGRCVI 
CGGPGVSDAYYCKECTIQEKDRDGCPKIVNL 
GSSKTDLFYERKKYGFXKR 


1340 


2690 


A 


103S8 


113 


3472 


SQLRKGASATHSSPSRTDCIAQMMDIYVCLK 

ItfSWMVDNKIlMRTASNFQWLLSTFILLYLM 

NQVNSQKKGAPHDUCCVTNNLQVWNCSWK 

APSGTGRGTDYEVCIENRSRSCYQLEKTSIKIP 

ALSHODYEmNSLHDFGSSrSKFrLNEQNVSL 

IPDTPEIU«-SADPSTSTLYLKWNDRGSVFPHR 

SNVIWEIKNOJUCESMELVKLVTHNTTLNGKD 

TLHHWSWASDMPLECAIHFVEIRCYIDNLHFS 

GLEEWSDWSPVKNISWIPDSQTKVFPQDKVIL 

VGSDITFCCVSQEKVLSAIJGHTNCPLIHLDGE 

NVAIKIRNISVSASSGTNWFTTBDNIFGTVIF 

AGYPPDTPQQLNCETHDLKEnCSWNPGRVTA 

LVGPRATSYTLVESFSGKYVRLKRAEAPTNES 

YQLLFQMLPNQEIYNFTLNAHNPLGRSQSm 

VNTTEKVYPHTPTSFKVKDINSTAVKLSWHLP 

GNFAKINFLCEIEIKKSNSVQEQRNVnKGVE 

NSSYLVALDKLNPYTLYTFRIRCSTETFWKW 

SKWSNKKQHLTTEASPSKGPDTWREWSSDG 

KNLIIYWKPLPINEANGKILSYNVSCSSDEETQ 

SLSEIPDPQHKAEIRlDKNDyilSWAKNSVGS 

SPPSKIASMEIPNDDLKIEQWGMGKGILLTW 

HYDPNMTCDYVIKWCNSSRSEPCLMDWRKV 

PSNSTETVIESDEFRPGlRYNFFLYGCRNQGy 

QLLRSMIGYIEELAPIVAFNFTVEDTSADSILV 

KWEDIPVEELRGFLRGYLFYFGKGERDTSKM 

RVLESGRSDIKVKNITDISQKTLRIADLQGKTS 

YHLVIJIAYTDGGVGPEKSMYVVTKENSVGL 

nAILIPVAVAVIVGWTSILCYRKREWIKETrY 

PDIPNPENCKALQFQKSVCEOSSALKTLEMNP 

CTPNNVBVLETRSAFPKIEDTEIVSPVAERPB«J 

RSDAKPENHWESYCyPHEEEIPNPAADETGG 

TAQVIYIDVQSMYQPQAKFEEEQENDPVGGA 

GYKPQMHLPmSTVBDIAAEEDLDKTAGYRP 

QANVNTWNLVSPDSPRSIDSNSEIVSFGSPCSI 

NSRQFLIPPKDEDSPKSNOOOWSFTNFFQNKP 

ND 


1341 


2691 


A 


10392. 


1 


5057 


MUTKHLSATKPKKSWAPNLYELDSDLTXEP 

DVnGEGPTDSEFFHQRFRNUYVEFVGPRKIL 

KUWIXXDWIXJPETRTKEEnELLVLEQYLTn 

PEKlja>WVRAKKPENCEKLVnXENYKEMY 

QPEGESLHOVLWSAGLRCPLGLSASTLLTW 

SGLDNSLSWAAVGMSCVLWDIELHHDFLGV 

ATKSVSTHAQGDAAQGLGGTIVRMWARDSN 

LATGVLLDDNNSDVTSDDDMTRNRRESSPPH 

SYHSFSGDRDWDRRGRSRDTHPRDRWSHTR 

NPRSRMPPRDLSLPWAKTSFeklDREDDRDS 

RAYESRSQDAESYQNWDLAEDRKPHNTIQD 

NMENYRKIXSLGVQtAEDDGHSHNlTQGHSS 

RSKRSAYPSTSRGLKTMPEAKKSTHRRGICED 

ESSHGVIMEKFIKDVSRSSKSGRARESSDRSQ 

RFPRMSDDNWKDISLNKRESVIQQRVYEONA 

FRGOFRFNSTLVSRKRVLERKRRYHFDTDQK 

GSMDQKGCPRKKPFECGSEMRKAMSVSSLS 

SLSSPSFTESQPIDFGAMPYVCDECGRSFSVIS 

EFVEHQIMHTRENLYEYGESFIHSVAVSEVQK 
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seq- 
uence 


SEQID 
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seq- 
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poptiAe 
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Predicted end 
nucleotide 
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to last amino 
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of peptide 
sequence 


Amino acid sequence (A^Alanine 0=Cysteine, 
D=AsDflrtic Acid. P.?=OliitiiTnir Ari/f 
F=4'henylalanine, OOlycine; H=Histidine. 
I=Isoleucine, K=Lysine, L^Leucine, 
K^Methionine, N^Asparagine, P=4»roline, 
Q=Glutamine, R^Aiginine, S'^etine, 
T=Threonine, V=Valine, W=Tiyptophan, 
Y=Tyosine, X=Unlaiown. ♦=Stop codon, 
^=pos5ibIe nucleotide deletion, V=^oS5ible 
nucleotide insertion 














SQVGGKRFECKDCGETFNKSAALAEHRKIHA 

RGYLVECKNQECEEAFMPSPTFSELQtOYQK 

DKFYECRVCKETFLHSSALISIQKIHFODDKD 

NEREHERERERERGETFRPSPALNEFQKMYG 

KEKMYECKVCGETTLHSSSLKEHQKIHniGN 

PFENKGKVCEETFIPGQSLKRRQKTyNKEKLC 

DFTDGRDAFMQSSELSEHQKIHSRKNLFEGR 

GYEKSVmSOPFTESQKSHnXRPLESDEDEKA 

FTtSSNPYENQKIPTKENVYEAKSYERSVIHSL 

ASVEAQKSHSVAGPSKPKVMAESTIQSFDAIN 

HQRVRAGGNTSEGREYSRSVIHSLVASKPPRS 

HNGNELVESNEKGESSIYISDLMDKRQHPAR 

ENPCEGGSKNRNYEDSVIQSVFRAKPQKSVF 

GEGSGEFKKDGH'SVPSSNVREYQKARAKKK 

YIEHRSNETSVIHSLFI'GEQTFRPRGMLYECQ 

ECGECFAHSSDLTBHQKIHDHEKPSGSRNYE 

WSVIRSLAPTDPQTSYAQEQYAKEQARNKCK 

DFRQFFATSEDLNTNQKIYDQEKSHGEESQGE 

NTDGEETHSEETHGQETIEDPVIQGSDMEDPQ 

KDDPDDKIYECEDCOLGFVDLTDLTDHQKVH 

SRKCLVDSREYTHSVMTHSISEYQRDYTGEQ 

LYECPKCGESFIHSSFLFEHQRIHEQDQLYSM 

KOCDDGFIAIXPMKPRRNRAAERNPALAGSA 

IROXCGQGFIHSSALNEHMRUIREDDLLEQS 

QMAEEAIlPGLALTEFQRSQTffiRLFECAVCG 

ESFVNPAELADHVTVHKNEPYEYGSSYTHTS 

rLlti'JLJtOAlrfrECKDCXJKSFiHSI 

LHLEEEEEDEAAAAAAAAAQEVEANVHVPQ 

WLRIQGLNVEAAEPEVEAAEPEVEAAEPEV 

EAAEPNGEAEGPDGEAAEPIGEAGQPNGEAB 

QPNGDADEPDGAGIEDPEERAEEPEGKAEEPE 

GDADEPDGVGEEDPEBGEDQEIQVEEPYYDC 

HECraTFrSSTAFSEHLKniAS\inFEPANAFG 

ECSGYIERASTSTG0ANQADEKYFKCDVCX3Q 

LFNDHLSLARHQNTOTG 


1342 


2692 


A 


10393 


2 


1350 


GRPRSSSDNRNFLRERAGLSSAAVQTRIGNSA 

ASRRSPAARPPVPAPPALPRGRPGTEGSTSLS 

APAVLWAVAWWWSAVAWAMANYIHV 

PPGSPEVPKLNVTVQDQEmRCREGALSLLQ 

HUU>HWDPQEVTLQUTT)GrrNKLIGCYVGN 

TMEDWLVRIYGNKTELLVDRDEEVKSFRVL 

OAHfST'APnT YpnTMMfif /''viTnfrimjAT rkDvn 

VCNPAIFRLIARQLAKIHAIHAHNGWIPKSNL 

WLKMGKYFSLIPTGFADEDINKRFLSDIPSSQI 

LQEEMTWMKmSNUJSPVVLCHNDLLCKNn 

YNEKQGDVQFIDYEYSGYNYLAYDIGNHFNE 

FAGVSDVDYSLYPDREUJSQWLRAYLEAYK 

EFKGFGTEVTH^EVEILFIQVNQFALASHFFW 

GLWALIQAKYSTIEFDFLGYAIVRFNQYFKM 

KPEVTALKVPE 


1343 


2693 


A 


10394 


102 


839 


PEAQTSAVLAREKGHLPTMRHEAPMQMASA 

QDARYGQKDSSDQNFDYMFKLUrGNSSVfflC 

TSFLFRYADDSFTSAFVSTVGIDFKVKTVFKN 

EKRIKLQIWDTAGQERYRTnTAYYRGAMGn 

LMYDITNEESFNAVQDWSTQIKTYSWDNAQ 

VILVGNKCDMEDERVISTERGQHLGEQLGFE 

FFETSAKDNlNVKQTFERLVDnCDKMSESLET 

DPAITAAKQNTRLKETPPPPQPNCAC 


"1344 


2694 


A 


10395 


2 


4136 


DRPPWNSRVDDFVTNLIHLSSKGfflSPAKDTS 
LQQRTPAEMSPVLHFYVRPSGHEOAASGHTR 



332 



wo 01/57188 



PCT/USOl/03800 



SEQ ID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQ ID 
NO: of 
peptide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/496 
914 


Predicted 

beginning 

nucleotide 

location 
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of peptide 
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Amino acid sequence (A'^Alanine C^Cy^t^'i'^ 
D=Aspartic Acid, B=Ohitamic Acid, 
F=Phenyla]aoine, G=01ycine, H=Histidine, 
I=IsoIeucine, K=Lysine, I^Leucine, 
M=Metiiionine, N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=^rine, 
T=Threoninc, V=VaIine, W=Tryptophan, 
Y=Tynisine, X=Unknovwi, ♦=Stop codon, 
/^possible nucleotide deletion, V=possible 
nucleotide insertion 














RKIXJGKLPKLgGVb-lELCYNVNWTAEALPSA 

EETKKLMWLFGCPLLLDDVARESWLLPGSN 

DLLLEVOPRLNFSTPTSTNIVSVCRATGLOPV 

DRVBTTRRYRLSFAHPPSAEVEAIALATLHDR 

MTEQHFPHFIQSFSPESMPEPLNGFINILGEGR 

LAIJEKANQELGIALDSWDLDFYTKRFQELQR 

NPSTVEAFDLAQSNSEHSRHWFFKGQLHVDG 

QKLVHSLFESIMSTQESSNPNNVLKFCDNSSA 

IQGKEVRFLRPEDPTRPSRFQQQQGLRHWFT 

AETHNFPTGVCPFSQATTGTGGRIRDVQCrG 

RGAHWAGTAGYCFGNLHIPGYNLPWEDLSF 

QYPGNFARPLEVAIEASNGASDYGNKFGEPV 

LAGFARSLGLQLPDGQRREWKPIMFSGGIGS 

MEADfflSKEAPEPGMEWKVGGPVYRIGVGG 

GAASSVQVQGDNTSDLDFOAVQRGDPEMEQ 

KMNRVIRACVEAPKGNPICSLHDQGAGCJNG 

NVLKELSDPAGAHYTSRFQLGDPTLNALEIW 

GAEYQESNAIXLRSPNRDELTHVSARBRCPA 

CFVGTITGDRRIVLVDDRECPVRRNGQGDAP 

PTPPPTPVDLELEWVLOKMPRKEFFLQRKPP 

MLQPLALPPOLSVHQALERVLRLPAVASKRY 

LTNKVDRSVGGLVAQQQCVGPLQTPLADVA 

WALSHEEUGAATALGEQPVKSLLDPKVAA 

RLAVAEALTNLVFALVTDLRDVKCSGNWM 

WAAKLPGEGAALADACEAMVAVMAALOVA 

VDGGKDSLSMAARVGTBTVRAI'GSLVISAYA 

VCPDITATVrPDLKHPEGRGHLLYVALSPGQ 

HRLGGTALAQCFSQLGEHPPDLDLPENLVRA 

FSrrQGLLKDRLLCSGHDVSDGGLVTCLLEM 

AFAGNCGLQVDVPVPRVDVLSVLFAEEPGLV 

LEVQEPDLAQVLKRYRDACHJICLELGHrGE 

AGPHAMVRVSVNGAWLEEPVOELRALWEE 

TSFQLDRLQAEPRCVAEEHRGLRERMGPSYC 

LPFITPKASVPREPGGPSPRVAILREEGSNGDR 

FRGVAFVGGFSYADVLOSAKGWAAAVTTHP 

RAGAEUUUTUCRPDTFSLGVCNGCQLLAJLLG 

WVGGDPNEDAAEMGPDSQPARPGLIIRHNL 

SORYESRWASVRVGPGPAIMLRQMEOAVLP 

VWSAHGEGYVAFSSPELQAQIEARGLAPLHW 

ADDDGNPTEQYPLNPNGSPGGVAGICSCDGR 

HLAVMPHPERAVRPWQWAWRPPPFDTLTTS 

PWLQLFINARNWTLEGSC 


134S 


269S 


A 


10396 


6S 


642 


GVROFWAGTMASRAGPRAAGTDGSDFQHRE 

RVAKfllYQMSVTLKYEIKKLIYVHLVIWLLLV 

AKMSVGHLRLLSHDQVAMPYQWEYPYLLSI 

LPSLLGLLSFPRNNISYLVL^NflSMGLFSIAPLI 

YGSMEMFPAAQQLYRHGKAYRFLFGFSAVSI 

MYLVLVLAVQVHAWQLYYSKKLLDSWFTST 

QEKKHK 


1346 


2696 


A 


10398 


1 


718 


DDFVRCGPQSAAMGASARLLRAVIMGAPGS ' 

GKGTVSSRTrrHFEUCHLSSGDIXRDNMLRGT 

EIGVLAKAFroQGKLBPDDVMTRLALHELKNL 

1XJYSWLUX3FPRTLPQAEALDRAYQIDTV1NL 

NVPFEVDCQRLTARWIHPASORVYNIEFNPPK 

TVGIDDLTOEPUQREDDKPETVIKRLKAYED 

QTKPVLEYYQKKGVUETFSGTETNKIWPYVY 

AFLQTKVPQRSQKASVTP 


1347 


2697 


A 


10402 


153 


1969 


KHRQENNALDMAPEIHMTGPMCLIENTNGEL 
VANPEALKILSAITQPVWVAIVGLYRTGKSY | 
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SEQID 
NO: of 
nucl- 
eotide 
seq- 
uence 


SEQID 
NO: of 
pq>tide 
seq- 
uence 


Met 
hod 


SEQ 
ID NO: 
in 

USSN 
09/4% 
914 


Predicted 

beginning 

nucleotide 

location 

conespondi 

ng to first 

amino acid 

rcadueof 

peptide 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to last amino 
acid residue 
of peptide 
sequence 


Amino acid sequence (A-^-Alanine OCysteine, 
D=Aspartic A^d, E^Glutamic Acid, 
F=Phenylalanine, <j=Glycine, H=Histidiiie, 
Msoleucine, K=Lysine, L=Leacine, 
M=*lethionine, N=Asparagine, P=Proline, 
Q=GIutamine, Kr=Aisinine, S=^erine, 
T=^rhreonine, V=Valine, W=Tryptophan, 
Y-Tyrosine, XHUnknown, *=5top codon, 
A=possible nucleotide deletion. V=possible 
nucleotide insertion 














LMNKLAGKNKOFSLGSTVKSHTKGIWMWCV 

PHPKKPEHTLVLLDTEGLGDVKKGDNQNDS 

WmiAVLLSSTLVYNSMGTINQQAMDQLYY 

VTELTEDURSKSSPDENENEDSADFVSFFPDFV 

WTLRDFSLDLEADGQPLTPDEYLEYSLKLTQ 

GTSQKDKNFNLPRLCIRKFFPKKXCFVFDLPI 

HRRKLAQLEKLQDEELDPEFVQQVADFCSYI 

FSNSKTKTLSGGDCVNGPRLESLVLTYINAISR 

GDLPCmNAVLALAQIENSAAVQKAIAHYD 

YMKNSFKDVDHLFQKKLAAQLDKKRDDFCK 

QNQEASSDRCSALLQVIFSPLEEEVKAGIYSK 

PGGYCLFIQKLQDLEKKYYEBPRKGIQAEEIL 

QTYLKSKESVTDAILQTDQILTEKEKEIEVEC 

VKAESAQASAKMVEHvlQIKYQQMMEEKEKS 

YQEHVKQLTEKMERERAQLLEEQEKTLTSKL 

QEQARVLKERCQGESTQLQNEIQKLQKTLKK 

KTKRYMSHKLKl 


1348 


2698 


A 


10404 


5 


892 


TQLPAPLSGVLSRLQLGSGAPLLTWVQETAG 

VAOOArKKKlrV IMWKLLAKAoAjrl-LRVrLS 

DSWALLPASAGVKTLLFVPSFEDVSIPEKPKL 

RFDERAPLVPKVRREPKNLSDIRGPSTEATEFT 

EGNEAILALGGGYLHWGHFEMMRLTINRSM 

DPKNMFAIWRVPAPFKPmuCSVGHRMGGGK 

GAIDHYVTPVKAGRLWEMGGRCEFEEVQG 

FLDQVAHKLPFAAKAVSRGTLEKMRKDQEE 

RERNNQNPWTFERlATANMLCaRKVLSPYDL 

THKGKYWGKFYMPKRV 


1349 


2699 


A 


10409 


59 


1184 


LRRNCSALGGLFQTIISDMKGSYPVWEDFINK 
AGKLQSQLRTTWAAAAFLDAFQKVADMAT 
NTOGGTREIOSALTRMCMRHRSIEAKLRQFSS 

EYKKARQEKKKSSDTLKLQKKAKKGRODIQ 

PQLDSALQDVNDKYLLLEETEKQAVRKAUE 

ERGRFCTTTSMLRPVDEEEISMLGEITHLQTISE 

DLKSLTMDPHKLPSSSEQVELDLKGSDYSWS 

YQTPPSSPSTTMSRKSSVCSSLNSVNSSDSRSS 

GSHSHSPSSHYRYRSSNLAQQAPVRLSSVSSH 

DSGnSQDAFQSKSPSFMPPEAPNQRRKHOlE 

PDPNGGGPTTASGPPAAAEEAQRPRSM 


13S0 


2700 


A 


10410 


Sll 


958 


AGRGGPGKPVSWSSGPGSPGQTQRRSWVKST 

RGHSSLLPPSQDFVAGLSVILRGTVDDRLNW 

AFNLYDUSlKDGCmCEEMLDIMKSlYDMMG 

KYTYPALREEAPREHVESFFQKMDKNKDGV 

VTIEEFIESCXJKDENIMRSMQLFDNVI 
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WHAT IS CLAIMED IS: 

1 . An isolated polynucleotide comprising a nucleotide sequence selected fiom the group 
consisting of SEQ ID NO: 1-1350, a mature protein coding portion of SEQ ID NO: 1-1350, an 
active domain of SEQ ID NO: 1-1350, and complementary sequences thereof 

2. An isolated polynucleotide encoding a polypeptide with biological activity, wherein said 
polynucleotide hybridizes to the polynucleotide of claim 1 under stringent hybridization 
conditions. 

3 . An isolated polynucleotide encoding a polypeptide with biological activity, wherein said 
polynucleotide has greater than about 90% sequence identity with the polynucleotide of claim 1. 

4. The polynucleotide of claim 1 wherein said polynucleotide is DNA. 

5. An isolated polynucleotide of claim 1 wherein said polynucleotide conqmses the 
complementary sequences. 

6. A vector comprising the polynucleotide of claim 1 . 

7. An expression vector comprising the polynucleotide of claim 1 . 

8. A host cell genetically engineered to comprise the polynucleotide of claim 1 . 

9. A host cell genetically engineered to comprise the polynucleotide of claim I operatively 
associated with a regulatory sequence that modulates expression of the polynucleotide in the host 
ceU. 

10. An isolated polypeptide, wherein the polypeptide is selected 6om the group consisting of: 

(a) a polypeptide encoded by any one of the polynucleotides of claim 1 ; and 

(b) apolypqjtideencodedbyapolynucleotidehybridizingunderstriiigentconditions 
withany one of SEQ IDNO:1-1350. 

11. A composition comprising the polypeptide of claim 1 0 and a carrier. 

12. An antibody directed against the polypeptide of claim 1 0. 
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13. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 

a) contacting the sample with a compoxmd that binds to and forms a complex 
with the polynucleotide of claim 1 for a period sufficient to form the complex; and 

b) detecting the complex, so that if a complex is detected, the polynucleotide 
of claim 1 is detected. 

14. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 

a) contactiog the sample under stringent hybridization conditions with 
nucleic acid primers that amieal to the polynucleotide of claim 1 under such conditions; 

b) amplifying a product comprising at least a portion ofthe polynucleotide of 

claim 1; and 

c) detecting said product and thereby the polynucleotide of claim 1 in the 

sample. 



15. The method of claim 14, wherein the polynucleotide is an RNA molecule and the method 
further comprises reverse transcribing an annealed RNA molecule into a cDNA polynucleotide. 

16. A method for detecting the polypeptide of claim 1 0 in a sample, comprising: 

a) contacting the sample with a compound that biads to and forms a complex 
wifli the polypeptide under conditions and for a period sufBcient to form the complex; and 

b) detecting fopnation of the complex, so that if a complex formation is 
detected, the polypq>tide of claim 1 0 is detected. 

17. A method for identifying a compound that binds to the polypeptide of claim 10, 
comprising: 

a) contacting the compound with the polypeptide of claim 1 0 under 
conditions sufGcient to form a polypeptide/compound complex; and 

b) detecting the complex, so that if tiie polypeptide/compound complex is 
detected, a compoimd tiiat bmds to the polypeptide of claim 1 0 is identified. 

18. A method for identifying a compound that bmds to the polypeptide of claim 1 0, 
comprising: 
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a) contacting the compound vnih the polypqjtide of claim 1 0, in a cell, under 
conditions sufficient to fonn a polypeptide/compound complex, v^^ierein the complex drives 
expression of a reporter gene sequence in the cell; and 

b) detecting the complex by detecting reporter gene sequence expression, so 
that if the polypeptide/compound complex is detected, a compound that binds to the polypeptide 
of claim 10 is identified. 

19. A method of produciog the polypeptide of claim 10, comprising, 

a) culturing a host cell comprising a polynucleotide sequence selected &om 
the group consisting of a polynucleotide sequence of SEQ ID NO: 1-1350, a mature protein 
coding portion of SEQ ID NO: 1-1350, an active domain of SEQ ID NO: 1-1350, 
complementary sequences thereof and a polynucleotide sequence hybridizing under stringent 
conditions to SEQ ID NO: 1,-1350, under conditions sufBcient to express the polypeptide in said 
cell; and 

b) isolating the polypeptide fiwm the cell culture or cells of step (a). 

20. An isolated polypeptide comprising an amino acid sequence selected fix)m the group 
consisting of SEQ ID NO: 1351-2700, tiie mature protein portion thereof, or the active domain 
thereof. 



21 . The polypeptide of claim 20 wherein the polypeptide is provided on a polypeptide anay. 

22. A collection of polynucleotides, -wiierein tihe collection comprises the sequence 
information of at least one of SEQ ID NO: 1-1350. 

23. The collection of claim 22, wherein the collection is provided on a nucleic acid array. 

24. The collection of claim 23, vidierein the array detects fiall-matches to any one of the ' 
polynucleotides in the collection. 

25. The collection of claim 23, wherein the array detects mismatches to any one of the 
pol3mucleotides in the collection. 



26. The collection of claim 22, v\dierein the collection is provided in a computer-readable 
format. 
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27. A method of treatment comprising administering to a mammalian subject in need thereof 
a therapevitic amount of a composition comprising a polypeptide of claim 10 or 20 and a 
phannaceutically acceptable carrier. 

28. A method of treatment comprising administering to a mammalian subject in need thereof 
a therapeutic amount of a composition comprising an antibody that specifically binds to a 
polypeptide of claim 1 0 or 20 and a phannaceutically acceptable carrier. 
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Pages 340 to 1963 of this application contain amino acid sequence listings. 
They can be obtained at the address given below. 

Les pages 340 to 1963 de cette demande contiennent des listages des sequences 
d'acides amines. Elles peuvent dtre obtenues a I'adresse indiqu6e ci-dessous. 



World Intellectual Property Organization 
34, chemin des Colombettes 
CH-1211 Gen§ve20 
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